This project is the implementation for Demospace's AI agent, which interacts with a user via voice and sends visual assets in real time. It uses LiveKit for real-time streaming over WebRTC and links multiple ML models (Deepgram, Claude/OpenAI, Eleven Labs) to power the voice-to-voice workflow.
Note that to use this agent, you'll need to have the Demospace frontend application running as well, connected to your LiveKit server.
- Real-time voice interaction with AI agent
- Integration with LiveKit for WebRTC streaming
- Utilizes multiple machine learning models for voice processing
- Python 3.11 or higher
- Poetry 1.2 or higher
- LiveKit server setup
- API keys for Deepgram, Anthropic, and Eleven Labs
-
Clone the repository:
git clone https://github.com/yourusername/your-repo-name.git cd your-repo-name
-
Install the required Python packages:
poetry install
- Start the LiveKit server and obtain the necessary credentials.
- Run the main script:
poetry run python main.py start
main.py
: Entry point of the application.livekit/
: Contains modules related to LiveKit integration.agents/
: Contains AI agent-related modules and functions.models/
: Contains machine learning models used for voice processing.
Contributions are welcome! Please open an issue or submit a pull request for any changes.
- LiveKit for providing the real-time streaming infrastructure.
- All contributors and maintainers of the project.