26 episodes

Tales at Scale cracks open the world of analytics projects. We’ll be diving into Apache Druid but also hearing from folks in the data ecosystem tackling everything from architecture to open source, from scaling to streaming and everything in between- brought to you by Imply.

Tales at Scale Imply Data

    • Technology
    • 5.0 • 5 Ratings

Tales at Scale cracks open the world of analytics projects. We’ll be diving into Apache Druid but also hearing from folks in the data ecosystem tackling everything from architecture to open source, from scaling to streaming and everything in between- brought to you by Imply.

    Securing the “Crown Jewels”: A Journey through Druid Database Security with Carrell Jackson

    Securing the “Crown Jewels”: A Journey through Druid Database Security with Carrell Jackson

    On this episode, we’re going all in on cybersecurity!  Helping us with what critical aspects of security you need to focus on when building analytics applications is Carrell Jackson, CISO at Imply. We’ll discuss the importance of protecting sensitive data by implementing role-based access control and encryption and hear about best practices for securing a Druid cluster. Listen to learn more about how Imply takes a security-first approach to their product development and stick around to hear where Certified Ethical Hacking fits into how Imply’s security stays ahead of threats.

    • 31 min
    Inside Apache Druid 29.0: Getting up to Speed on Druid’s Performance, Ecosystem, and SQL Compliance with Sergio Ferragut

    Inside Apache Druid 29.0: Getting up to Speed on Druid’s Performance, Ecosystem, and SQL Compliance with Sergio Ferragut

    On this episode, we explore Apache Druid 29.0, focusing on three specific themes: performance, ecosystem, and SQL compliance. Discover new features such as EARLIEST / LATEST support for numerical columns, system fields ingestion, and enhanced array support like UNNEST and JSON_QUERY_ARRAY. In addition, get the full scoop on community-contributed extensions like Spectator Histogram and DDsketch for efficient quantile calculations and long-tailed distribution support. Learn about what’s new with MSQ, what’s up with PIVOT / UNPIVOT, and so much more!

    • 22 min
    A Year in Review: Apache Druid's 2023 Highlights with Peter Marshall

    A Year in Review: Apache Druid's 2023 Highlights with Peter Marshall

    In this special episode of Tales at Scale - this is our final episode of our first season! - Peter Marshall, Director of Developer Relations at Imply joins the show to discuss the highlights of 2023 for Apache Druid. We dive into the significant feature releases and enhancements that have transformed Druid over the past year, including the SQL standardizaion, query from deep storage, experimental window functions, and the growing Druid community. Come for the retrospective, stay for the peek into the future of what’s to come for us and for Druid in 2024. See you all next year!

    • 26 min
    From ANSI SQL Support to Multi-topic Kafka Ingestion: What's New in Apache Druid 28 with Will Xu

    From ANSI SQL Support to Multi-topic Kafka Ingestion: What's New in Apache Druid 28 with Will Xu

    On this episode, we dive into Apache Druid 28. This latest Druid release includes improved ANSI SQL and Apache Calcite support, the addition of window functions as an experimental feature, async queries and query from deep storage going GA, array enhancements, multi-topic Apache Kafka ingestion, and so much more! Will Xu, program manager at Imply returns to give us the full scoop.

    • 25 min
    Druid and Joins Debunked! with Sergio Ferragut and Hellmar Becker

    Druid and Joins Debunked! with Sergio Ferragut and Hellmar Becker

    On this episode, we debunk the myth that Druid can't do joins. Druid doesn't function as a traditional relational database because it was purpose-built for lightning-fast queries on large datasets. However, this doesn't mean Druid is entirely devoid of join capabilities – it simply approaches them differently. Our myth-busting team features returning guests Sergio Ferragut and Hellmar Becker from Imply ready to clarify how Druid handles joins in its own unique way and tackle what Druid is for in the first place. 

    • 15 min
    Scaling with Speed: How Atlassian's Confluence Big Data Platform Team Delivers Customer-Facing Insights with Apache Druid with Gautam Jethwani and Kasirajan Selladurai Selvakumari

    Scaling with Speed: How Atlassian's Confluence Big Data Platform Team Delivers Customer-Facing Insights with Apache Druid with Gautam Jethwani and Kasirajan Selladurai Selvakumari

    On this episode, we explore how Atlassian leverages Apache Druid's capabilities to handle millions of daily events and empower users with intelligent data-driven features. We’re joined by Gautam Jethwani and Kasirajan Selladurai Selvakumari from the Confluence Big Data Platform Team who will talk through how they use Druid to power intelligent features, sub-second query latency, and complex ingestion tasks.

    • 16 min

Customer Reviews

5.0 out of 5
5 Ratings

5 Ratings

Top Podcasts In Technology

No Priors: Artificial Intelligence | Technology | Startups
Conviction | Pod People
Lex Fridman Podcast
Lex Fridman
All-In with Chamath, Jason, Sacks & Friedberg
All-In Podcast, LLC
Acquired
Ben Gilbert and David Rosenthal
Hard Fork
The New York Times
This Week in XR Podcast
Charlie Fink Productions