Chain of Thought

Galileo
Chain of Thought

Introducing Chain of Thought, the podcast for software engineers and leaders that demystifies artificial intelligence. Join us each week as we tell the stories of the people building the AI revolution, unravel actionable strategies and share practical techniques for building effective GenerativeAI applications.

  1. 1 DAY AGO

    The Making of Gemini 2.0: DeepMind's Approach to AI Development and Deployment | Logan Kilpatrick

    Google’s strength in AI has often seemed to get lost in the midst of OpenAI announcements or DeepSeek fervor - yet Gemini 2.0 is more than good for many tasks; it’s the model to beat - and we have the research to back it up.  This week, Logan Kilpatrick, senior product manager at Google DeepMind, joins us to discuss Gemini’s creation story, its emergence as the premiere model in the AI race, and why the launch of Gemini 2.0 is great news for developers. During the conversation Conor and Logan explore the exciting world of multimodal AI, Gemini's strengths in agentic use cases, and its unique approach to function calling, compositional function calling, and the seamless integration of tools like search and code execution. They also chat about Logan’s vision for a future where AI interacts with the world more naturally, offering a view of the potential of vision-first AI agents, and why Google's hardware advantage is enabling Gemini's impressive performance and long context capabilities.  Follow along with the discussion using Galileo’s AI Agent Leaderboard:https://huggingface.co/spaces/galileo-ai/agent-leaderboard Chapters:00:00 DeepMind's Role in Gemini's Development 03:49 Gemini 2.0 Updates and Developer Highlights 06:08 Agentic Use Cases and Function Calling 11:29 Multimodal Capabilities 16:15 Putting AI in Production 21:06 Gemini's Differentiation and Hardware 31:22 Future Vision for Gemini and G Suite Integration 35:23 Gemini for Developers 39:02 Conclusion and FarewellFollow the hostsFollow⁠⁠⁠Atin⁠⁠⁠Follow⁠⁠⁠Conor⁠⁠ Follow⁠Vikram⁠Follow⁠⁠⁠Yash⁠⁠⁠ Follow Logan Twitter:@OfficialLoganK LinkedIn:https://www.linkedin.com/in/logankilpatrick/ Show Notes Try Gemini for yourself:gemini.google.com Gemini for Developers:aistudio.google.com Check out Galileo ⁠⁠Try Galileo⁠⁠

    41 min
  2. JAN 15

    AI in 2025: Agents & The Rise of Evaluation Driven Development

    "In the next three to five years, every piece of software that is built on this planet will have some sort of AI baked into it." - Atin Sanyal Chain of Thought is back for its second season, and this episode dives headfirst into the possibilities AI holds for 2025 and beyond. Join Conor Bronson as he chats with Galileo co-founders Yash Sheth (COO) and Atindriyo Sanyal (CTO) about major trends to look for this year. These include AI finding its product "tool stack" fit, generation latency decreasing, AI agents, their potential to revolutionize code generation and other industries, and the crucial role of robust evaluation tools in ensuring the responsible and effective deployment of these agents. Yash and Atin also highlight Galileo's focus on building trust and security in AI applications through scalable evaluation intelligence. They emphasize the importance of quantifying application behavior, enforcing metrics in production, and adapting to the evolving needs of AI development. Finally, they discuss Galileo's vision for the future and their active pursuit of partnerships in 2025 to contribute to a more reliable and trustworthy AI ecosystem. Chapters: 00:00 AI Trends and Predictions for 2025 02:55 Advancements in LLMs and Code Generation 05:16 Challenges and Opportunities in AI Development 10:40 Evaluating AI Agents and Applications 16:07 Building Evaluation Intelligence 23:41 Research Opportunities 29:50 Advice for Leveraging AI in 2025 32:00 Closing Remarks Show Notes: Check out Galileo⁠⁠⁠⁠⁠⁠⁠⁠⁠ Follow Yash Follow Atin Follow Conor

    33 min
  3. 12/18/2024

    How AI Assistants Can Enhance Human Connection | Twilio’s Vinnie Giarrusso

    Can AI assistants actually enhance human connection? As Season 1 of Chain of Thought comes to a close, Conor Bronsdon and Vinnie Giarrusso (Twilio) explore the transformative potential of AI assistants in the workplace. Discover how these assistants function as "async junior digital employees," taking on specific tasks and contributing to the organizational structure. But will AI assistants ultimately replace human connection? Vinnie argues the opposite is true, suggesting that AI can liberate employees from mundane tasks, allowing them to focus on building meaningful relationships and providing personalized experiences. This thought-provoking conversation takes a philosophical turn as Vinnie explores how AI could revolutionize education while potentially disrupting traditional mentorship roles. He shares his vision for a future where AI democratizes information and empowers individuals to personalize their learning journey. Finally, learn how Twilio and Galileo are partnering to shape the future of AI and what this collaboration means for both companies. Chain of Thought will be taking a break for the holidays, but we'll see you back here on January 8th for the start of Season 2! Chapters: 00:00 Twilio's AI Agent Platform 06:34 Ensuring Accuracy and Trustworthiness 09:49 Challenges and Failure Modes 17:39 Future of Fully Autonomous Agents 22:18 Human-AI Collaboration and Mentorship 31:24 Education and Democratization of Information 32:58 Partnership with Galileo 39:54 Conclusion and Season Wrap-Up Follow: Conor Bronsdon: https://www.linkedin.com/in/conorbronsdon/ Vinnie Giarrusso: https://www.linkedin.com/in/vinniegiarrusso/ Show notes: Twilio Alpha: ⁠https://twilioalpha.com OWASP GenAI: https://genai.owasp.org

    42 min
  4. 12/11/2024

    Lessons from Deploying AI at Enterprise Scale | ServiceTitan, Indeed & Twilio

    This week, a panel of experts (Mehmet Murat Ezbiderli, ServiceTitan; Grant Ledford, Indeed; and Vinnie Giarrusso, Twilio) join Atin Sanyal (CTO, Galileo) and Conor Bronsdon (Developer Awareness, Galileo) to explore the challenges and opportunities of deploying GenAI at enterprise scale in a conversation that's a wake-up call for any business leader looking to harness the power of AI. Together, Atin & Conor break down key considerations like performance, cost, and model selection, emphasizing the need for robust evaluation frameworks and a shift in developer mindset. Atin then sits down with our panel of AI engineering experts to discuss their firsthand experiences with enterprise AI, including the trade-offs of building AI systems, the evolving tools and frameworks available, and the impact these technologies are having on their organizations. Chapters: 00:00 Enterprise Scale Deployment 05:17 Cost, Performance, and Model Selection 08:59 Building and Integrating GenAI Systems 15:26 Emerging Enterprise Use Cases 18:12 Predictions for AI in 2025 27:28 Panel Discussion: Deploying AI at Enterprise Scale 31:19 Gen AI Solutions and Challenges 33:12 Building & Deploying Traditional Infrastructure vs GenAI Infrastructure 34:36 How to Assemble Your GenAI Stack 40:39 Today's Best GenAI Use Cases 48:15 Enterprise AI Trends for 2025 50:36 Closing Remarks and Future Outlook Follow: Atin Sanyal: ⁠⁠⁠https://www.linkedin.com/in/atinsanyal/⁠ Mehmet Murat Ezbiderli: https://www.linkedin.com/in/mehmet-murat-ezbiderli-b894a49/ Grant Ledford: https://www.linkedin.com/in/grant-ledford-36b146a5/ Vinnie Giarrusso: https://www.linkedin.com/in/vinniegiarrusso/ Show notes: Watch all of Productionize: https://www.galileo.ai/genai-productionize-2-0

    51 min
  5. 12/04/2024

    Practical Lessons for GenAI Evals | Chip Huyen & Vivienne Zhang

    As AI agents and multimodal models become more prevalent, understanding how to evaluate GenAI is no longer optional – it's essential.  Generative AI introduces new complexities in assessment compared to traditional software, and this week on Chain of Thought we’re joined by Chip Huyen (Storyteller, Tép Studio), Vivienne Zhang (Senior Product Manager, Generative AI Software, Nvidia) for a discussion on AI evaluation best practices.  Before we hear from our guests, Vikram Chatterji (CEO, Galileo) and Conor Bronsdon (Developer Awareness, Galileo) give their takes on the complexities of AI evals and how to overcome them through the use of objective criteria in evaluating open-ended tasks, the role of hallucinations in AI models, and the importance of human-in-the-loop systems. Afterwards, Chip and Vivienne sit down with Atin Sanyal (Co-Founder & CTO, Galileo) to explore common evaluation approaches, best practices for building frameworks, and implementation lessons. They also discuss the nuances of evaluating AI coding assistants and agentic systems. Chapters: 00:00 Challenges in Evaluating Generative AI 05:45 Evaluating AI Agents 13:08 Are Hallucinations Bad? 17:12 Human in the Loop Systems 20:49 Panel discussion begins 22:57 Challenges in Evaluating Intelligent Systems 24:37 User Feedback and Iterative Improvement 26:47 Post-Deployment Evaluations and Common Mistakes 28:52 Hallucinations in AI: Definitions and Challenges 34:17 Evaluating AI Coding Assistants 38:15 Agentic Systems: Use Cases and Evaluations 43:00 Trends in AI Models and Hardware 45:42 Future of AI in Enterprises 47:16 Conclusion and Final Thoughts Follow: Vikram Chatterji: https://www.linkedin.com/in/vikram-chatterji/ Atin Sanyal: ⁠⁠https://www.linkedin.com/in/atinsanyal/ Conor Bronsdon: https://www.linkedin.com/in/conorbronsdon/ Chip Huyen: ⁠https://www.linkedin.com/in/chiphuyen/⁠ Vivienne Zhang: ⁠⁠https://www.linkedin.com/in/viviennejiaozhang/ Show notes: Watch all of Productionize 2.0: ⁠https://www.galileo.ai/genai-productionize-2-0⁠

    48 min
5
out of 5
6 Ratings

About

Introducing Chain of Thought, the podcast for software engineers and leaders that demystifies artificial intelligence. Join us each week as we tell the stories of the people building the AI revolution, unravel actionable strategies and share practical techniques for building effective GenerativeAI applications.

You Might Also Like

To listen to explicit episodes, sign in.

Stay up to date with this show

Sign in or sign up to follow shows, save episodes, and get the latest updates.

Select a country or region

Africa, Middle East, and India

Asia Pacific

Europe

Latin America and the Caribbean

The United States and Canada