ODSC's Ai X Podcast

Inside Google’s New AI Stack with Paige Bailey

In this episode of the ODSC AiX Podcast, host Sheamus McGovern reconnects with Paige Bailey, Engineering Lead at Google DeepMind for the Developer Experience team. Paige shares how the Gemini ecosystem has evolved since her last appearance, including the launch of Gemini 2.5 DeepThink, multimodal video generation with Veo 3, real-time music creation with Lyria RT, and groundbreaking advances in agentic and on-device AI systems. The conversation explores the rapid rise of agent-based workflows, AI-powered robotics, and the growing divide between cutting-edge tools and real-world adoption. Key Topics Covered: Gemini 2.5 DeepThink & Reasoning Models The model that won gold at the International Mathematical Olympiad (IMO) Use cases for DeepThink, Pro, Flash, and FlashLite variants Using Gemini Live API for real-world robotics and decision planning Role of multimodal inputs (video, audio, text) in enabling embodied AI On-Device AI & Ubiquity Implications for edge deployment, cost reduction, and accessibility Veo 3: Multimodal Video Generation Lyria RT: Real-Time Music Generation Gemini Live API & Voice Interfaces Real-time bidirectional voice, screen understanding, and tool calling Rise of voice as the dominant AI interface Use of SynthID and digital watermarking to detect deepfakes Future of AI-agent orchestration via MCP servers Memorable Outtakes: On the pace of model development: “A 4-billion parameter model on-device now outperforms our best cloud model from six months ago. That’s pretty magical.” — Paige Bailey On the role of AI agents in robotics: “You can say, ‘Hey robot, go get me that apple,’ and Gemini will plan the task, route it, and call the right control models.” — Paige Bailey On the AI adoption gap: “In the Bay Area, we use AI hourly. But when I talk to developers in the Midwest, they often aren’t using it at all.” — Paige Bailey References & Resources: Paige Bailey Dynamic Web Paige: https://webpaige.dev/ LinkedIn: https://www.linkedin.com/in/dynamicwebpaige GitHub: https://github.com/dynamicwebpaige Medium: https://medium.com/@dynamicwebpaige Previous podcast with Paige: https://podcasters.spotify.com/pod/show/ai-x-podcast/episodes/Googles-AI-Powered-Tools-for-Data-Scientists-Building-the-Automated-Future-of-Data-Science-with-Paige-Bailey-e2p3t6e Resources mentioned International Mathematical Olympiad (IMO): https://www.imo-official.org Model Context Protocol (MCP): https://modelcontextprotocol.io/docs/getting-started/intro Gemini 2.5 Deep Think: https://blog.google/products/gemini/gemini-2-5-deep-think/ Veo 3: https://deepmind.google/technologies/veo/ Lyria RT & Music AI Sandbox: https://deepmind.google/technologies/lyria/ SynthID & Deepfake Watermarking: https://deepmind.google/technologies/synthid/ Gemma Models: https://ai.google.dev/gemma Gemini Live API Docs: https://ai.google.dev/gemini-api/docs/live Google AI Studio: https://ai.google.dev Sponsored by: 🔥 ODSC AI West 2025 – The Leading AI Training Conference Join us in San Francisco from October 28th–30th for expert-led sessions on generative AI, LLMOps, and AI-driven automation. Use the code podcast for 10% off any ticket. Learn more: https://odsc.ai