voice-agent

Real-Time AI Voice Agent
A real-time voice agent built using LiveKit Agents, AssemblyAI for Speech-to-Text (STT), Cartesia for Text-to-Speech (TTS), Mistral / OpenAI LLMs, and Silero Voice Activity Detection (VAD).

This project enables real-time, bidirectional voice interactions with AI models by combining streaming audio processing with large language models, allowing practical use cases such as conversational assistants, voice-driven applications, and real-time natural language dialogue systems.

🧠 Features

Live voice processing pipeline
- Continuous speech capture and decoding
- Real-time agent responses
Speech-to-Text (STT)
- Powered by AssemblyAI (or other ASR systems)
Text-to-Speech (TTS)
- Cartesia or other configurable TTS providers
Voice Activity Detection (VAD)
- Silero VAD ensures efficient capture and reduces noise
Large Language Model integration
- Mistral / OpenAI models for reasoning and dialogue
Pluggable architecture
- Modular audio, model, and transport layers
Test suite
- Simple import tests included

🎯 Typical Use Cases

Conversational voice assistants
Interactive voice experiences
Real-time chat with AI using voice
Accessibility tools (hands-free interfaces)
Rapid prototyping for voice-enabled agents

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.env		.env
.gitattributes		.gitattributes
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
edge_tts_plugin.py		edge_tts_plugin.py
myagent.py		myagent.py
pyproject.toml		pyproject.toml
services.py		services.py
test_imports.py		test_imports.py
test_imports_2.py		test_imports_2.py
test_output.txt		test_output.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

voice-agent

🧠 Features

🎯 Typical Use Cases

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

voice-agent

🧠 Features

🎯 Typical Use Cases

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages