
This project shows how to prototype a real-time voice AI Android app using Gemini 2.0’s Live API over WebSockets as an open-source proof of concept before committing to full production infrastructure. By combining low-level audio control on Android, duplex audio streaming, and multimodal AI, we built an

As WebRTC developers, we’ve gotten very good at moving real-time media around the globe. But often, the most exciting and valuable work happens when we stop just routing media and add some processing to it. The challenge is that building custom, real-time media processing workflows is often

The era of clunky, keypad-driven legacy IVR customer service systems that have long frustrated users is finally over. The future of Interactive Voice Response is truly conversational, and it’s ready for prime time. That’s why Deepgram’s State of Voice AI 2025 report says 84% of business leaders