The Data Stack Show

Rudderstack

Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

  1. 264: Infrastructure as Code Meets AI: Simplifying Complexity in the Cloud with Alexander Patrushev of Nebius

    8 HR AGO

    264: Infrastructure as Code Meets AI: Simplifying Complexity in the Cloud with Alexander Patrushev of Nebius

    This week on The Data Stack Show, Alexander Patrushev joins John to share his journey from working on mainframes at IBM to leading AI infrastructure innovation at Nebius, with stops at VMware and AWS along the way. The discussion explores the evolution of AI and cloud infrastructure, the five pillars of successful machine learning projects, and the unique challenges of building and operating modern AI data centers—including energy consumption, cooling, and networking. Alexander also delves into the practicalities of infrastructure as code, the importance of data quality, and offers actionable advice for those looking to break into the AI field. Key takeaways include the need for strong data foundations, thoughtful project selection, and the value of leveraging existing skills and tools to succeed in the rapidly evolving AI landscape. Don’t miss this great conversation. Highlights from this week’s conversation include: Alexander’s Background and Early Career at IBM (1:06)Moving From Mainframes to Virtualization at VMware (4:09)Transitioning to AWS and Machine Learning Projects (8:22)What Was Missed From Mainframes and the Rise of Public Cloud (9:03)Security, Performance, and Economics in Cloud Infrastructure (12:40)The Five Pillars of Successful Machine Learning Projects (15:02)Choosing the Right ML Project: Data, Impact, and Existing Solutions (18:01)Real-World AI and ML Use Cases Across Industries (19:42)Building Specialized AI Clouds Versus Hyperscalers (22:08)Performance, Scalability, and Reliability in AI Infrastructure (25:18)Data Center Energy Consumption and Power Challenges (28:41)Cooling, Networking, and Supporting Systems in AI Data Centers (30:06)Infrastructure as Code and Tooling in AI (31:50)Lowering Complexity for AI Developers and the Role of Abstraction (34:08)Startup Opportunities in the AI Stack (38:53)When to Fine-Tune or Post-Train Foundation Models (43:41)Comparing and Testing Models With Tool Use (47:49)Skills and Advice for Entering the AI Field (49:18)Final Thoughts and Encouragement for AI Newcomers (52:31)The Data Stack Show is a weekly podcast powered by RudderStack, customer data infrastructure that enables you to deliver real-time customer event data everywhere it’s needed to power smarter decisions and better customer experiences. Each week, we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

    53 min
  2. 263: The End of Busywork: How AI Transforms Productivity at Scale with Alberto Rizzoli of V7

    24 SEPT

    263: The End of Busywork: How AI Transforms Productivity at Scale with Alberto Rizzoli of V7

    This week on The Data Stack Show, AI entrepreneur Alberto Rizzoli shares his journey from early computer vision breakthroughs to leading the automation of back-office workflows at V7. The discussion explores the shift from bespoke model training to configurable AI solutions, the impact of automation on business roles, and emerging best practices for integrating AI into enterprises. Listeners will gain insight into how AI infrastructure is moving from labs to everyday businesses, which roles are most vulnerable or secure amid automation, and why future-proofing your career means focusing on creativity, first principles, and continuous improvement. Don’t miss it!  Highlights from this week’s conversation include: Setting the Stage: AI’s Hype and Today’s Innovations (1:16)Alberto’s Non-Tech Passions: Physics & UX (4:04)The Paradigm Shift: Machines that Adapt (6:22)Scaling AI: From Niche Apps to Mainstream Use (8:23)Large Models vs. Bespoke Solutions—Power Law in AI (11:07)Evolving Roles: From Engineer to End User (14:14)Simple vs. Complex AI Implementations (18:14)When to Scale from Simple AI to Production-Grade (22:40)Capturing Tacit Knowledge: Crowdsourcing vs. Centralization (27:22)The Challenge of Unstructured Process Documentation (30:08)Practical Impact: AI-Enabled Enterprise Leverage (33:29)ROI: Time Saved & Compound Effects in the Enterprise (38:08)Redefining Information Movers vs. Information Creators (44:14)Roles at Risk and the Case for Creativity (45:30)Alberto's Favorite New Tech & Future of User Experience (46:00)Spreadsheets, Business Logic, and AI’s Next Leap (48:41)The Data Stack Show is a weekly podcast powered by RudderStack, customer data infrastructure that enables you to deliver real-time customer event data everywhere it’s needed to power smarter decisions and better customer experiences. Each week, we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

    50 min
  3. 262: From PDFs to BI and Beyond: The Future of the Data Frontend with Ryan Dolley of GoodData

    17 SEPT

    262: From PDFs to BI and Beyond: The Future of the Data Frontend with Ryan Dolley of GoodData

    This week on The Data Stack Show, Ryan Dolley joins Eric and John to discuss his unique journey from playwriting to leading product strategy in the data industry. The conversation explores the evolution of business intelligence (BI), the growing influence of AI on analytics, and the shifting skill sets required for data professionals. Key topics include the challenges of adapting to rapid technological change, the importance of embracing engineering practices in BI, and the need for continuous learning. Listeners will gain insights into how AI is transforming data roles, why storytelling remains central to analytics, practical advice for thriving in a fast-changing industry, and so much more.  Highlights from this week’s conversation include: Ryan’s Journey: From Playwriting to Data (1:05)Making a Living as a Playwright (3:02)Transitioning to BI: Night School and First Data Jobs (4:12)Storytelling and Data: The Art of BI (6:22)Early BI Work: Data Warehouses and PDF Reports (8:33)Moving from Utilities to Consulting (13:03)Building vs. Implementing: Product Strategy Lessons (16:37)The AI Shift in BI and Analytics (18:41)Automation Anxiety: The Human Side of Data Change (22:16)The Evolving Role of BI Experts (25:18)Adapting to Change: Learning Code and Experimentation (29:34)AI and the Future of Embedded Analytics (33:38)Capturing Intent: The Value of Modern BI Interfaces (37:03)Bridging the Data and Software Engineering Gap (39:13)The Historical Divide: Data vs. Software Engineering (43:06)Organizational Challenges: Where Does BI Belong? (46:05)Reflections on Self-Service BI and Value (48:46)If Not Data: Ryan’s Alternate Career Paths (49:04)Final Thoughts and Takeaways (50:17)The Data Stack Show is a weekly podcast powered by RudderStack, customer data infrastructure that enables you to deliver real-time customer event data everywhere it’s needed to power smarter decisions and better customer experiences. Each week, we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

    51 min
  4. 261: Will AI Permanently Disrupt the Bundling and Unbundling Cycle?

    10 SEPT

    261: Will AI Permanently Disrupt the Bundling and Unbundling Cycle?

    This week on The Data Stack Show, Eric Dodds and John Wessel explore how AI is reshaping the data industry, focusing on the ongoing cycles of bundling and unbundling within data infrastructure. They discuss the potential for closed ecosystems like Notion to deliver personalized, integrated experiences and examine recent industry moves such as Fivetran’s acquisitions. The conversation also highlights the challenges faced by both startups and incumbents, the influence of enterprise customers on product development, and the enduring importance of trade-offs when choosing between bundled and unbundled solutions. Key takeaways include the complexity of implementing AI across platforms, the likelihood that market cycles will persist despite technological advances, and the need for organizations to carefully weigh integration, flexibility, and long-term risk when adopting new data tools. Highlights from this week’s conversation include: AI’s Value and Early Ecosystem Integration (1:11)Closed Ecosystems and AI Opportunities (3:21)Personalized Software and the Blank Page Problem (6:17)Transition to Data Industry: Bundling Trends (9:56)Market Cycles and AI’s Role in Bundling (12:56)Incumbents, Innovation, and AI Layering (15:53Longevity of Legacy Systems and Ecosystem Risks (17:56)Switching Costs and Incumbent Advantages (20:33)People Dynamics and the Startup-to-Incumbent Arc (22:50)Enterprise Data Infrastructure: Engineering Challenges (26:33)Fragmentation, Bundling Value, and AI’s Insulation Effect (29:54)Too Many Tools: The Real Meaning Behind Bundling Demand (31:36)Trade-offs in Bundling, Unbundling, and AI (33:40)Final Thoughts and Takeaways (34:34)The Data Stack Show is a weekly podcast powered by RudderStack, customer data infrastructure that enables you to deliver real-time customer event data everywhere it’s needed to power smarter decisions and better customer experiences. Each week, we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

    35 min
  5. 260: Return of the Dodds: APIs, Automation, and the Art of Data Team Survival

    3 SEPT

    260: Return of the Dodds: APIs, Automation, and the Art of Data Team Survival

    This week on The Data Stack Show, the crew welcomes Eric Dodds back to the show as they dive into the realities of integrating AI and large language models into data team workflows. Eric, Matt and John discuss the promise and pitfalls of AI-driven automation, the persistent challenges of working with APIs, and the evolution from big data tools to AI-powered solutions. The conversation also highlights the risks of over-reliance on single experts, the critical importance of documentation and context, and the gap between AI marketing hype and practical implementation. Key takeaways for listeners include the necessity of strong data fundamentals, the hidden costs and risks of AI adoption, the importance of balancing efficiency gains with long-term team resilience, and so much more. Highlights from this week’s conversation include: Eric is Back from Europe (0:37)AI and Data: Jurisdiction and Comfort Level (4:00)APIs, Tool Calls, and Practical AI Limitations (5:08)Scaling, Big Data, and AI’s Current Constraints (9:16)Stakeholder-Facing AI and Data Team Risks (13:20)Self-Service Analytics and AI’s Real Impact (16:04)AI Hype vs. Reality and Uneven Impact (20:27)Cost, Context, and AI’s Practical Barriers (25:25)AI for Admin Tasks and Business Logic Complexity (29:13)Tribal Knowledge, Documentation, and Context Engineering (32:07)AI as a Productivity Accelerator and the “Gary Problem” (35:10)Healthy Conflict, Team Dynamics, and AI’s Limits (39:15)Back to Fundamentals: Good Practices Enable AI (41:47)Lightning Round: Favorite AI Tools and Workflow Integration (45:56)AI in Everyday Life and Closing Thoughts (48:14)The Data Stack Show is a weekly podcast powered by RudderStack, customer data infrastructure that enables you to deliver real-time customer event data everywhere it’s needed to power smarter decisions and better customer experiences. Each week, we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

    51 min

About

Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

You Might Also Like