In this episode of the ODSC AiX Podcast, host Sheamus McGovern reconnects with Paige Bailey, Engineering Lead at Google DeepMind for the Developer Experience team. Paige shares how the Gemini ecosystem has evolved since her last appearance, including the launch of Gemini 2.5 DeepThink, multimodal video generation with Veo 3, real-time music creation with Lyria RT, and groundbreaking advances in agentic and on-device AI systems. The conversation explores the rapid rise of agent-based workflows, AI-powered robotics, and the growing divide between cutting-edge tools and real-world adoption.
Key Topics Covered:
Gemini 2.5 DeepThink & Reasoning Models
The model that won gold at the International Mathematical Olympiad (IMO)
Use cases for DeepThink, Pro, Flash, and FlashLite variants
Using Gemini Live API for real-world robotics and decision planning
Role of multimodal inputs (video, audio, text) in enabling embodied AI
On-Device AI & Ubiquity
Implications for edge deployment, cost reduction, and accessibility
Veo 3: Multimodal Video Generation
Lyria RT: Real-Time Music Generation
Gemini Live API & Voice Interfaces
Real-time bidirectional voice, screen understanding, and tool calling
Rise of voice as the dominant AI interface
Use of SynthID and digital watermarking to detect deepfakes
Future of AI-agent orchestration via MCP servers
Memorable Outtakes:
On the pace of model development: “A 4-billion parameter model on-device now outperforms our best cloud model from six months ago. That’s pretty magical.” — Paige Bailey
On the role of AI agents in robotics: “You can say, ‘Hey robot, go get me that apple,’ and Gemini will plan the task, route it, and call the right control models.” — Paige Bailey
On the AI adoption gap: “In the Bay Area, we use AI hourly. But when I talk to developers in the Midwest, they often aren’t using it at all.” — Paige Bailey
References & Resources:
Paige Bailey
Dynamic Web Paige: https://webpaige.dev/
LinkedIn: https://www.linkedin.com/in/dynamicwebpaige
GitHub: https://github.com/dynamicwebpaige
Medium: https://medium.com/@dynamicwebpaige
Previous podcast with Paige: https://podcasters.spotify.com/pod/show/ai-x-podcast/episodes/Googles-AI-Powered-Tools-for-Data-Scientists-Building-the-Automated-Future-of-Data-Science-with-Paige-Bailey-e2p3t6e
Resources mentioned
International Mathematical Olympiad (IMO): https://www.imo-official.org
Model Context Protocol (MCP): https://modelcontextprotocol.io/docs/getting-started/intro
Gemini 2.5 Deep Think: https://blog.google/products/gemini/gemini-2-5-deep-think/
Veo 3: https://deepmind.google/technologies/veo/
Lyria RT & Music AI Sandbox: https://deepmind.google/technologies/lyria/
SynthID & Deepfake Watermarking: https://deepmind.google/technologies/synthid/
Gemma Models: https://ai.google.dev/gemma
Gemini Live API Docs: https://ai.google.dev/gemini-api/docs/live
Google AI Studio: https://ai.google.dev
Sponsored by:
🔥 ODSC AI West 2025 – The Leading AI Training Conference
Join us in San Francisco from October 28th–30th for expert-led sessions on generative AI, LLMOps, and AI-driven automation.
Use the code podcast for 10% off any ticket.
Learn more: https://odsc.ai
Information
- Show
- FrequencyUpdated Weekly
- Published5 September 2025 at 4:00 am UTC
- Season1
- Episode81