Data Engineering Central Podcast

Data Engineering in Real Life

Long Live the Data Engineer. No holds barred. Talking about Data Engineering news, topics, and general mayhem. dataengineeringcentral.substack.com

  1. 3 DAYS AGO

    He Quit Apple After 13 Years

    In this episode of Data Engineering Central, I sit down with Kevin, who spent 13 years working at Apple before walking away at the end of 2025. * Not to jump to another job. * Not to start a company. * But to take a step back from everything. Kevin shares his full journey—from growing up in the suburbs of Atlanta to building a career at Apple, and ultimately reaching the point where he could walk away financially and mentally. You can follow along with Kevin below. We dive deep into what it’s really like working in tech: the high salaries, the lifestyle creep, the pressure, and the surprising reality that even people making great money often have no clear financial plan. This conversation also explores the rise of FIRE (Financial Independence, Retire Early), how Kevin discovered it through Mr. Money Mustache, and why his perspective on it has changed over time. Thanks for reading Data Engineering Central! This post is public so feel free to share it. What starts as a path to freedom can easily turn into a scarcity mindset—and that’s something most people don’t talk about. We also get into: * Why high income does not equal financial freedom * The hidden trap of lifestyle inflation in tech * The simple investing strategy that actually works (and why most people ignore it) * Why many engineers are “close” to freedom—but never pull the trigger * The psychology of money, status, and why people stay stuck * How a failed project and burnout became a turning point * And how Kevin went from overworked and unhealthy… to climbing mountains and preparing to backpack 1,000 miles This is not your typical “get rich quick” or “retire at 30” conversation. It’s a grounded, honest look at money, work, and what it actually takes to build a life you don’t need to escape from. If you work in tech, think about FIRE, or just feel like you’re stuck on the treadmill, this one will hit home. Thanks for reading Data Engineering Central! This post is public so feel free to share it. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    52 min
  2. 24 MAR

    Spark, AI, and the Future of Data Engineering with Daniel Aronovich

    In this episode of Data Engineering Central, I sit down with the founder of DataFlint, Daniel Aronovich, to talk about the realities of working with Apache Spark, distributed data systems, and the future of data engineering. We start with his early journey into tech—how he first discovered large-scale data systems and the lessons he learned from working with real-world Spark workloads. * The conversation then turns toward the future of data engineering, particularly the growing role of AI in software development and data infrastructure. We discuss why generic AI coding assistants often struggle with complex distributed systems, whether AI will eventually be able to automatically optimize data pipelines, and how the role of the data engineer may evolve in the coming years. We covered a lot of career advice for new and upcoming data professionals. We also discuss the origin of DataFlint, a tool designed to help engineers better understand and optimize Spark workloads by analyzing execution plans, logs, and runtime context. If you work with Spark, large-scale data pipelines, or modern data platforms, this conversation will give you a deeper look into how the data engineering landscape is evolving. Thanks for reading Data Engineering Central! This post is public so feel free to share it. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    47 min
  3. 18 MAR

    DuckDB, AI, and the Future of Data Engineering

    In this episode, I sit down with Matt Martin, Staff Engineer, data architect, ETL practitioner, and author of a new book on DuckDB coming soon, to talk about the past, present, and future of data engineering. Matt has spent decades building and architecting data platforms across technologies such as SQL Server, Oracle, DB2, Hadoop, Redshift, and BigQuery, and now focuses on modern tools such as DuckDB and single-node analytics. We discuss how the data industry has evolved, what actually makes data platforms succeed, and where tools like DuckDB, Polars, Databricks, and Snowflake fit into the future of analytics. We also dive into the impact of AI on coding and data engineering, and whether distributed compute clusters will remain dominant — or if more workloads will move toward high-performance single-node systems. Topics Covered * Matt’s early career and journey into data engineering * The evolution of data warehousing and ETL frameworks * Traditional enterprise data systems vs modern cloud platforms * DuckDB and the rise of single-node analytics * Polars vs DuckDB: where each tool shines * Databricks vs Snowflake * AI-assisted coding and its impact on engineers * The current data engineering job market * Lessons learned from decades of building data systems * Writing a book on DuckDB This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    1 hr
  4. Spark, Lakehouse & AI: A Deep Conversation with Bart Konieczny

    25 FEB

    Spark, Lakehouse & AI: A Deep Conversation with Bart Konieczny

    In this episode of Data Engineering Central, I sit down with Bart Konieczny — data engineer, distributed systems expert, and well-known author in the Data and Spark ecosystem — for a deep technical conversation about modern data engineering. We cover: * How Bart got into tech and distributed systems * His journey through different engineering roles * Spark internals and why they still matter * The realities of lakehouse architecture * Streaming vs batch systems * AI’s impact on data engineering * What engineers should focus on in 2026 In a world obsessed with abstractions and AI tooling, we explore whether understanding the internals is still worth it — or if the game has fundamentally changed. If you’re a data engineer, architect, or platform leader trying to navigate the next phase of the lakehouse era, this one’s for you. Thanks for reading Data Engineering Central! This post is public so feel free to share it. — 🎙️ Data Engineering Central PodcastHosted by Daniel Beach If you’re a CTO or data leader looking for help building or optimizing your data platform, reach out — consulting inquiries welcome. Data Engineering Central is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    45 min
  5. 18 FEB

    DevOps vs ClickOps with Maxine Meurer

    In this episode of the Data Engineering Central Podcast, I sit down with Maxine Meurer, DevOps engineer, author, and educator behind I Love DevOps, for a wide-ranging conversation about careers, infrastructure, automation, and what it actually means to build systems that last. This isn’t a buzzword-heavy DevOps chat. It’s a grounded, honest discussion between two engineers about how people really get into tech, how careers evolve over time, and why modern infrastructure is as much about systems thinking and human judgment as it is about tools. We talk through Maxine’s journey from early technical curiosity to hands-on DevOps work, dealing with “ClickOps” to automation-first infrastructure, and how writing and teaching reshaped the way she thinks about engineering. What we cover in this episode: * 🛠️ From ClickOps to DevOps — what that transition actually looks like in the real world * 🧠 Why DevOps is fundamentally about systems and people, not just pipelines and YAML * 📚 How Maxine went from self-teaching to authoring practical guides like LLMs for Humans and The DevOps Career Switch Blueprint * 🤯 Common mistakes engineers make when learning DevOps, cloud, and distributed systems * 🔍 Testing failures, production realities, and where modern infrastructure still breaks down * 🤖 What AI and LLMs actually change for engineers, and what’s mostly hype * 🧭 Career advice for engineers without a traditional background * 🔮 Where DevOps and platform engineering are heading over the next 3–5 years Throughout the conversation, Maxine brings a refreshing, human-centered perspective to topics that are often over-abstracted or oversold. We dig into the tradeoffs behind tooling choices, the reality of production systems, and the importance of learning how to think, not just what to deploy. If you’re navigating a DevOps or infrastructure career, wrestling with modern stacks, or trying to make sense of AI’s role in engineering, this episode offers clarity, context, and hard-won insight. Learn more about Maxine’s work: * Writing & guides: * LinkedIn: https://www.linkedin.com/in/maxinemeurer/ * Gumroad resources: https://mameurer.gumroad.com Thanks for reading Data Engineering Central! This post is public so feel free to share it. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    41 min
  6. The Evolution of Software, Streaming, and Data Engineering with Robin Moffatt

    9 FEB

    The Evolution of Software, Streaming, and Data Engineering with Robin Moffatt

    In this episode, I sit down with industry veteran Robin Moffatt — Sr. Principal Advisor in Streaming Data Technologies (Kafka, etc.) and a longtime voice in the data engineering community, to unpack the journey from old-school data architectures to today’s real-time streaming ecosystems. From early mainframe data processing and COBOL through the rise of Apache Kafka, streaming ETL, and event-driven systems, Robin shares lived experience from across decades of building, scaling, and evolving data platforms. We dive into: * 🧠 How the role of software engineering has shifted with the rise of distributed, real-time systems * 📊 Why event streaming and platforms like Kafka aren’t just messaging systems, but the backbone of modern data architectures * 🚀 How the community’s tooling and mental models have had to evolve — from static databases and nightly jobs to continuous, always-on streaming applications * 🤖 A candid look at how AI and real-time data are intersecting, shaping both tooling and expectations for the next decade * 🔮 Robin’s perspective on where the industry is headed — beyond buzzwords toward real engineering maturity Along the way, we get historical context, real-world lessons from conference stages and community forums, and a perspective on building resilient, scalable systems that power today’s data-rich applications. If you’ve ever wondered how we got from batch jobs to continuous event streams, or what it really takes to build modern pipelines that support AI workflows, this conversation with Robin is a must-listen. For more from Robin: * 📍 His personal blog & talks: https://rmoff.net/ * 🔗 LinkedIn profile: https://www.linkedin.com/in/robinmoffatt Thanks for reading Data Engineering Central! This post is public so feel free to share it. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe

    50 min

About

Long Live the Data Engineer. No holds barred. Talking about Data Engineering news, topics, and general mayhem. dataengineeringcentral.substack.com

You Might Also Like