Data Engineering Central Podcast

Data Engineering in Real Life

Long Live the Data Engineer. No holds barred. Talking about Data Engineering news, topics, and general mayhem. dataengineeringcentral.substack.com

  1. 1D AGO

    AI Isn’t Replacing Curious Developers

    AI isn’t just changing how we write code. It’s changing what it even means to build software. In this episode of the Data Engineering Central Podcast, I sit down with Neil Roberts — a developer who’s been through every major wave of the web, from BASIC on an Atari to modern TypeScript, and now deep into LLMs and agentic workflows. This is not another surface-level “AI will change everything” conversation. We get into what is actually happening right now, where it works, where it completely breaks, and what developers are getting wrong. * We talk about why front-end and UX matter more than ever in an AI world, why most people misunderstand agents, and what real day-to-day workflows with LLMs actually look like. * There’s also a hard look at who benefits from AI, who falls behind, and whether we are quietly building fragile systems that we don’t fully understand. If you’re a developer trying to figure out where this is all going, this is one of those conversations worth paying attention to. Data Engineering Central is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber. Expect to learn: * Why AI is as much a UX problem as it is a backend problem * What “agents” actually mean in practice, not in demos * Where LLM workflows are useful today and where they fail hard * Whether junior developers should be worried or excited * How building apps changes when AI is part of the system * What developers should actually be doing right now to stay relevant Neil also has a podcast, The Skill Tree, on AI and agentic-specific topics. We also get into a bigger question most people are avoiding: * Are we heading toward AI-assisted coding… or AI-orchestrated systems where developers become supervisors? * And maybe more importantly… which side of that shift do you want to be on? Thanks for reading Data Engineering Central! This post is public so feel free to share it. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    1h 4m
  2. APR 29

    AI Is Changing Data Engineering Fast

    In this episode of the Data Engineering Central Podcast, I sit down with Andreas Kretz to break down what is really happening in the industry right now. We go far beyond surface-level AI hype and talk about how data engineering actually works in the real world, what skills still matter, and where most engineers are wasting time. Andreas shares his full journey from industrial IoT and working at Bosch to building one of the largest data engineering education platforms in the world, training over 2,000 students and reaching more than 100,000 engineers globally. We get into what production data systems actually look like, why most learning paths are broken, and how AI is reshaping the role of the modern data engineer. Thanks for reading Data Engineering Central! This post is public so feel free to share it. * We also dig into the uncomfortable truths. AI can write code, but it cannot replace thinking. Most engineers focus too much on tools and not enough on problem-solving, system design, and communication. That gap is only getting bigger. If you are trying to figure out how to stay relevant in data engineering, or you are just getting started and want to avoid years of wasted effort, this conversation will change how you think about your career. Today’s podcast is sponsored by Estuary. Without them, content like this isn’t possible. The best way to support this Newsletter is to check out what Estuary has to offer and click the links below. Build millisecond-latency, scalable, future-proof data pipelines in minutes. Estuary is the Right-Time Data Platform that integrates all of the systems you use to produce, process, and consume data. Also, providing best-in-class CDC (Change Data Capture). Estuary unifies today’s batch and streaming paradigms so that your systems, current and future, are synchronized around the same datasets, updating in milliseconds. What we cover: * Why most data engineers are learning the wrong things * The shift from coding to problem-solving and system design * How AI is actually changing data engineering workflows * Why courses and tutorials are becoming less effective * The difference between real production systems and “toy projects.” * The future of data engineering jobs and whether AI will replace them * Why fundamentals still matter more than ever One of the biggest takeaways is simple. The tools will keep changing, but the problems stay the same. The engineers who win are those who understand systems, ask better questions, and connect business problems to real solutions. Links: * Learn Data Engineering Academy: https://learndataengineering.com * Andreas Kretz on LinkedIn * Andreas Kretz on YouTube * Sponsor: https://estuary.dev Data Engineering Central is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    57 min
  3. Most Data Teams Are Doing It Wrong

    APR 22

    Most Data Teams Are Doing It Wrong

    Most data teams think they’re building value. In reality, they’ve become ticket queues. In this episode, Chris Gambill explains his storied career in tech and data through the years, dealing with data at Fortune 500 company scale, and breaking out on his own. We cover career growth, what separates senior engineers from true strategic operators, and the biggest mistakes people make early on. We discuss the classic problems that have plagued data teams for decades and why it’s all still a struggle. Today’s podcast is sponsored by Estuary. Without them, content like this isn’t possible. The best way to support this Newsletter is to check out what Estuary has to offer and click the links below. Build millisecond-latency, scalable, future-proof data pipelines in minutes. Estuary is the Right-Time Data Platform that integrates all of the systems you use to produce, process, and consume data. Also, providing best-in-class CDC (Change Data Capture). Estuary unifies today’s batch and streaming paradigms so that your systems, current and future, are synchronized around the same datasets, updating in milliseconds. We also dig into Databricks vs Snowflake, what matters and what doesn’t, and how to think about modern data architecture without falling for marketing hype. * On the AI side, we talk about why most LLMs, in the context of developer lifecycles, have changed how we do data, and also about what human skills cannot be replaced. If you care about leveling up beyond just building pipelines, this one is for you. Thanks for reading Data Engineering Central! This post is public so feel free to share it. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    59 min
  4. APR 15

    From Industrial Data at BASF to Delta Lake Committer

    In this episode, Robert Pack walks through his journey from engineering and simulation work to building large-scale data systems across 900+ plants at BASF. We break down what those systems actually looked like, including ingestion, modeling, and the realities of batch vs real-time in industrial environments. We also dive into: * AI Workflows for Developers * His work as a committer on Delta Lake * Where lakehouse architecture works and where it falls short * The transition into Developer Relations at Databricks This is a grounded, practical conversation about what actually matters when building data platforms. Today’s podcast is sponsored by Estuary. Without them, content like this isn’t possible. The best way to support this Newsletter is to check out what Estuary has to offer and click the links below. Build millisecond-latency, scalable, future-proof data pipelines in minutes. Estuary is the Right-Time Data Platform that integrates all of the systems you use to produce, process, and consume data. Also, providing best-in-class CDC (Change Data Capture). Estuary unifies today’s batch and streaming paradigms so that your systems, current and future, are synchronized around the same datasets, updating in milliseconds. You can find Robert on LinkedIn and GitHub, below. Thanks for reading Data Engineering Central! This post is public so feel free to share it. Come follow me on YouTube!! This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    48 min
  5. APR 1

    He Quit Apple After 13 Years

    In this episode of Data Engineering Central, I sit down with Kevin, who spent 13 years working at Apple before walking away at the end of 2025. * Not to jump to another job. * Not to start a company. * But to take a step back from everything. Kevin shares his full journey—from growing up in the suburbs of Atlanta to building a career at Apple, and ultimately reaching the point where he could walk away financially and mentally. You can follow along with Kevin below. We dive deep into what it’s really like working in tech: the high salaries, the lifestyle creep, the pressure, and the surprising reality that even people making great money often have no clear financial plan. This conversation also explores the rise of FIRE (Financial Independence, Retire Early), how Kevin discovered it through Mr. Money Mustache, and why his perspective on it has changed over time. Thanks for reading Data Engineering Central! This post is public so feel free to share it. What starts as a path to freedom can easily turn into a scarcity mindset—and that’s something most people don’t talk about. We also get into: * Why high income does not equal financial freedom * The hidden trap of lifestyle inflation in tech * The simple investing strategy that actually works (and why most people ignore it) * Why many engineers are “close” to freedom—but never pull the trigger * The psychology of money, status, and why people stay stuck * How a failed project and burnout became a turning point * And how Kevin went from overworked and unhealthy… to climbing mountains and preparing to backpack 1,000 miles This is not your typical “get rich quick” or “retire at 30” conversation. It’s a grounded, honest look at money, work, and what it actually takes to build a life you don’t need to escape from. If you work in tech, think about FIRE, or just feel like you’re stuck on the treadmill, this one will hit home. Thanks for reading Data Engineering Central! This post is public so feel free to share it. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    52 min
  6. MAR 24

    Spark, AI, and the Future of Data Engineering with Daniel Aronovich

    In this episode of Data Engineering Central, I sit down with the founder of DataFlint, Daniel Aronovich, to talk about the realities of working with Apache Spark, distributed data systems, and the future of data engineering. We start with his early journey into tech—how he first discovered large-scale data systems and the lessons he learned from working with real-world Spark workloads. * The conversation then turns toward the future of data engineering, particularly the growing role of AI in software development and data infrastructure. We discuss why generic AI coding assistants often struggle with complex distributed systems, whether AI will eventually be able to automatically optimize data pipelines, and how the role of the data engineer may evolve in the coming years. We covered a lot of career advice for new and upcoming data professionals. We also discuss the origin of DataFlint, a tool designed to help engineers better understand and optimize Spark workloads by analyzing execution plans, logs, and runtime context. If you work with Spark, large-scale data pipelines, or modern data platforms, this conversation will give you a deeper look into how the data engineering landscape is evolving. Thanks for reading Data Engineering Central! This post is public so feel free to share it. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    47 min
  7. MAR 18

    DuckDB, AI, and the Future of Data Engineering

    In this episode, I sit down with Matt Martin, Staff Engineer, data architect, ETL practitioner, and author of a new book on DuckDB coming soon, to talk about the past, present, and future of data engineering. Matt has spent decades building and architecting data platforms across technologies such as SQL Server, Oracle, DB2, Hadoop, Redshift, and BigQuery, and now focuses on modern tools such as DuckDB and single-node analytics. We discuss how the data industry has evolved, what actually makes data platforms succeed, and where tools like DuckDB, Polars, Databricks, and Snowflake fit into the future of analytics. We also dive into the impact of AI on coding and data engineering, and whether distributed compute clusters will remain dominant — or if more workloads will move toward high-performance single-node systems. Topics Covered * Matt’s early career and journey into data engineering * The evolution of data warehousing and ETL frameworks * Traditional enterprise data systems vs modern cloud platforms * DuckDB and the rise of single-node analytics * Polars vs DuckDB: where each tool shines * Databricks vs Snowflake * AI-assisted coding and its impact on engineers * The current data engineering job market * Lessons learned from decades of building data systems * Writing a book on DuckDB This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    1 hr

About

Long Live the Data Engineer. No holds barred. Talking about Data Engineering news, topics, and general mayhem. dataengineeringcentral.substack.com

You Might Also Like