
Database Technology in the Age of AI with DuckDB Labs co-creator Hannes Mühleisen
In this episode of The Data Engineering Show, host Benjamin and co-host Eldad sit with CEO DuckDB Labs and co-creator DuckDB, Hannes Mühleisen.
Together, they:
- Talk about the journey of DuckDB, an open-source analytical database system designed as a universal wrangling tool.
- Explain how DuckDB differs from SQLite, highlighting the analytical and transactional use cases.
- Discuss DuckDB’s special feature and its approach to innovation including creating their Parquet Reader.
- Explore the simple and efficient ecosystem of DuckDB, allowing developers to add custom functionality without changing its core stability.
- Consider Hannes' perspective on the role of AI in databases.
- Delve into the system’s infrastructure, design choices and the dedication of the team to ensure a continuous, reliable database system.
If you enjoyed this episode, make sure to subscribe, rate, and review it on Apple Podcasts, Spotify, and YouTube Podcasts, instructions on how to do this are [insert link].
Hannes Mühleisen is the CEO of DuckDB Labs and a Professor in The Netherlands, renowned for co-creating DuckDB, an open-source analytical database system. With a background in database architecture and research from CWI database architectures group, he has pioneered the development of DuckDB as a universal data wrangling tool that can run everywhere from phones to space satellites. Under his leadership, DuckDB has achieved remarkable success, reaching 10 million downloads monthly and becoming a go-to solution for analytical database needs. His commitment to keeping DuckDB lightweight, portable, and hardware-agnostic while maintaining high performance has revolutionized how developers approach analytical database solutions. As both an academic and technology leader, Hannes brings unique insights into database architecture, open-source development, and the future of analytical data processing.
Episode Highlights:
- The Purpose of DuckDB (01:04)
- SQLite vs DuckDB (02:53)
- The Importance of Collaboration (08:14)
- The Component-Based Architecture of DuckDB (11:25)
- The Parquet Reader Journey (17:51)
- The Role of AI in Database Interaction (22:41)
- SQL - A Defined Interface (29:20)
- The Golden Age of Database (38:57)
If you enjoyed this episode, make sure to subscribe, rate, and review it on Apple Podcasts, Spotify, and YouTube Podcasts. Instructions on how to do this are here.
Quotes:
- “DuckDB is a universal data wrangling tool. It is a relational data management system that speaks SQL designed to do well on analytical use cases.”
- “We call ourselves the SQLite for analytics because it explains the original design goal of DuckDB very well.”
- “Within the database engine space, we are all working to solve the same problems, and that's like, a hundred of us on the planet.”
- “It actually turns out in order to make a competent parquet reader, you do need query execution. There is just no way around it.”
- “I really like this golden age of databases we are in and personally, as somebody who really likes tables and SQL, I'm quite happy to see things like firebolt and others really working on core engine stuff.”
For Feedback & Discussions on Firebolt Core:
- Join Firebolt Discord Community
- Join Firebolt GitHub Discussions
- Firebolt Core Github Repository
- Benjamin@Firebolt.io
Previous guests include: Joseph Machado of Linkedin, Metthew Weingarten of Disney, Joe Reis and Matt Housely, authors of The Fundamentals of Data Engineering, Zach Wilson of Eczachly Inc, Megan Lieu of Deepnote, Erik Heintare of Bolt, Lior Solomon of Vimeo, Krishna Naidu of Canva, Mike Cohen of Substack, Jens Larsson of Ark, Gunnar Tangring of Klarna, Yoav Shmaria of Similarweb and Xiaoxu Gao of Adyen.
Check out our three most downloaded episodes:
- Zach Wilson on What Makes a Great Data Engineer
- Joe Reis and Matt Housley on The Fundamentals of Data Engineering
- Bill Inmon, The Godfather of Data Warehousing
資訊
- 節目
- 頻率隔月更新
- 發佈時間2025年3月19日 上午11:00 [UTC]
- 長度31 分鐘
- 集數40
- 年齡分級兒少適宜