Open source voice AI platform. Self-hosted alternative to Vapi and Retell. On Prem, BYOK across Speech to Speech or LLM/STT/TTS, with a visual workflow builder, MCP native and telephony support.
-
Updated
Jun 19, 2026 - Python
Open source voice AI platform. Self-hosted alternative to Vapi and Retell. On Prem, BYOK across Speech to Speech or LLM/STT/TTS, with a visual workflow builder, MCP native and telephony support.
Conversational voice AI agents
Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.
Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)
Set of 📝 with 🔗 to help those building Voice AI agents 🎙️🤖
Official one-stop shop for AI Agents and developers building with Telnyx.
DialogLab is an authoring tool for configuring and running Human-AI multi-party conversations.
Sayna is a unified Voice Layer for AI Agents with a seemless integration to an existing agentic frameworks
A New End-to-end Framework for Evaluating Voice Agents
A self-improving loop for voice AI agents. Uses karpathy's autoresearch as foundation.
The ElevenLabs Agents SDK for TypeScript.
A programmable voice platform: SIP and WebRTC call control, multi-party mixing, recording, TTS/STT, and pluggable AI agents (ElevenLabs, VAPI, Pipecat, Deepgram) — all driven through a REST API, webhooks, and a WebSocket event stream
Open-Source CPAAS for Contact Center Teams.
Voice Prompts, GPT-4o prompts, Voice Agent Prompts, ChatGPT Prompts, HumeAI Prompts
A real-time voice/call AI agent that lets you talk to a LangGraph agent over LiveKit — similar to "voice mode" experiences in ChatGPT Voice, OpenAI Realtime API sessions, and Gemini Live. This repo demonstrates adapting any LangGraph agent into a full-duplex, low-latency voice assistant using LiveKit Agents.
From-scratch voice agents in Python: end-to-end speech pipelines, runnable chapters, and a small shared library. Local models, explicit streaming behavior.
Self-hosted native-Rust runtime for real-time voice agents. Own the stack: one binary in your own VPC or air-gapped, no hosted control plane. pipecat-compatible pipeline, in-process SIP/RTP, single-process call density. Apache-2.0.
A curated list of voice AI agent frameworks, tools, resources, and best practices
Add a description, image, and links to the voice-agents topic page so that developers can more easily learn about it.
To associate your repository with the voice-agents topic, visit your repo's landing page and select "manage topics."