This episode is sponsored by Thuma.
Thuma is a modern design company that specializes in timeless home essentials that are mindfully made with premium materials and intentional details.
To get $100 towards your first bed purchase, go to http://thuma.co/eyeonai
—————————————————————————————————————————
AI deployment is broken—can it be fixed? In this episode, Tuhin Srivatsa, CEO & Co-Founder of Baseten, reveals how his company is DISRUPTING AI infrastructure, making it easier, faster, and more cost-effective to deploy and scale AI models in production.
As enterprises increasingly turn to open-source AI models and grapple with the high costs and complexity of scaling, Baseten offers a game-changing solution that eliminates bottlenecks and simplifies the process. Discover how Baseten is taking on AWS SageMaker, OpenAI, and cloud-based AI deployment platforms to reshape the future of AI model deployment.
What You’ll Learn in This Episode:
-
Why AI deployment & scaling is one of the biggest challenges in 2025
-
How Baseten enables enterprises to run AI models faster & more efficiently
-
The shift from closed-source to open-source AI models—and why it matters
-
The hidden costs of AI inference & how to optimize for performance
-
Why most AI models fail in production and how to prevent it
-
The future of AI infrastructure: What comes next for scalable AI
Whether you’re a machine learning engineer, AI researcher, startup founder, or enterprise leader, this episode is packed with actionable insights to help you scale AI models without the headaches.
Don’t miss this conversation on the next era of AI deployment!
#AI #ArtificialIntelligence #MachineLearning #Baseten #AIDeployment #AIScaling #Inference #MLInfrastructure #TechPodcast
Stay Updated:
Craig Smith Twitter: https://twitter.com/craigss
Eye on A.I. Twitter: https://twitter.com/EyeOn_AI
—————————————————————————————————————————
(00:00) Tuhin Srivatsa’s Journey in AI & Baseten
(01:50) What is AI Infrastructure & Why It Matters
(03:30) How Baseten Optimizes AI Model Deployment
(05:19) Why Most AI Deployments Fail (And How to Fix It)
(09:17) The Future of Open-Source AI Models in Enterprise
(11:01) How Baseten Automates AI Scaling & Inference
(14:12) Why AI Developers Struggle with Cloud-Based AI Tools
(18:47) The Real Cost of AI Inference (And How to Reduce It)
(20:44) Why AI Scaling is the Biggest Challenge in 2025
(26:55) Can AI Run on Non-NVIDIA Chips? (The Hardware Debate)
(31:23) The Future of AI Model Deployment & Inference
(37:05) How AI Agents & Reasoning Models Are Changing the Game
(40:39) The Truth About AI Hype vs. Reality
(45:04) How to Get Started with Baseten
(45:48) The Future of AI Infrastructure
Information
- Show
- FrequencyUpdated Weekly
- PublishedFebruary 26, 2025 at 2:00 PM UTC
- Length46 min
- RatingClean