This project shows how to prototype a real-time voice AI Android app using Gemini 2.0’s Live API over WebSockets as an open-source proof of concept before committing to full production infrastructure. By combining low-level audio control on Android, duplex audio streaming, and multimodal AI, we built an

If your voice AI system can touch real systems or trigger actions with business consequences, your approach to AI agent tool calling security matters. When voice AI agents can modify customer data, trigger escalations, update ticketing systems, or execute workflows—especially for customer service in regulated industries like

The choice between conversation-based and turn-based Voice AI agent patterns is a strategic business decision, not just a technical detail. Beyond what your agent will say, you must decide how it will run. This architectural choice defines how your voicebot will scale, what it will cost to

Many WebRTC applications struggle with outdated or inappropriate media server infrastructure, limiting their ability to scale effectively and support powerful AI features. Alfred Gonzalez, Senior WebRTC Engineer at WebRTC.ventures, walks us through the considerations, options, and steps to successfully migrate to another media server. He’ll then show
