51 episodes

Hey folks, this podcast features interviews with Computer Science researchers from across the world talking about their latest work. The current focus area of the podcast is data management, so expect research from conferences such as SIGMOD, VLDB, ICDE, and CIDR. 
The show is hosted by Jack Waudby, in each episode researchers are interviewed, highlighting the problem(s) they tackled, solutions they developed, and how their findings can be applied in practice. 
This podcast is for industry practitioners, researchers, and students, and aims to further narrow the gap between research and practice, and to generally make awesome Computer Science research more accessible.
You can support the show through Buy Me a Coffee. A donation of $3 will help us keep making you awesome Computer Science research podcasts. 

Hosted on Acast. See acast.com/privacy for more information.

Disseminate: The Computer Science Research Podcast Jack Waudby

    • Education
    • 5.0 • 4 Ratings

Hey folks, this podcast features interviews with Computer Science researchers from across the world talking about their latest work. The current focus area of the podcast is data management, so expect research from conferences such as SIGMOD, VLDB, ICDE, and CIDR. 
The show is hosted by Jack Waudby, in each episode researchers are interviewed, highlighting the problem(s) they tackled, solutions they developed, and how their findings can be applied in practice. 
This podcast is for industry practitioners, researchers, and students, and aims to further narrow the gap between research and practice, and to generally make awesome Computer Science research more accessible.
You can support the show through Buy Me a Coffee. A donation of $3 will help us keep making you awesome Computer Science research podcasts. 

Hosted on Acast. See acast.com/privacy for more information.

    Pat Helland | Scalable OLTP in the Cloud: What’s the BIG DEAL? | #50

    Pat Helland | Scalable OLTP in the Cloud: What’s the BIG DEAL? | #50

    In this thought-provoking podcast episode, we dive into the world of scalable OLTP (OnLine Transaction Processing) systems with the insightful Pat Helland. As a seasoned expert in the field, Pat shares his insights on the critical role of isolation semantics in the scalability of OLTP systems, emphasizing its significance as the "BIG DEAL." By examining the interface between OLTP databases and applications, particularly through the lens of RCSI (READ COMMITTED SNAPSHOT ISOLATION) SQL databases, Pat talks about the limitations imposed by current database architectures and application patterns on scalability.
    Through a compelling thought experiment, Pat explores the asymptotic limits to scale for OLTP systems, challenging the status quo and envisioning a reimagined approach to building both databases and applications that empowers scalability while adhering to established to RCSI. By shedding light on how today's popular databases and common app patterns may unnecessarily hinder scalability, Pat sparks discussions within the database community, paving the way for new opportunities and advancements in OLTP systems. Join us as we delve into this conversation with Pat Helland, where every insight shared could potentially catalyze significant transformations in the realm of OLTP scalability.
    Papers mentioned during the episode:
    Scalable OLTP in the Cloud: What’s the BIG DEAL?Autonomous ComputingDecoupled TransactionsDon't Get Stuck in the "Con" GameThe Best Place to Build a SubwayBuilding on QuicksandSide effects, front and centerImmutability changes everythingIs Scalable OLTP in the Cloud a solved problem?
    You can find Pat on:
    Twitter/XLinkedInScattered Thoughts on Distributed Systems
    Hosted on Acast. See acast.com/privacy for more information.

    • 1 hr 20 min
    Rui Liu | Towards Resource-adaptive Query Execution in Cloud Native Databases | #49

    Rui Liu | Towards Resource-adaptive Query Execution in Cloud Native Databases | #49

    In this episode, we talk to Rui Liu and explore the transformative potential of Ratchet, a groundbreaking resource-adaptive query execution framework. We delve into the challenges posed by ephemeral resources in modern cloud environments and the innovative solutions offered by Ratchet. Rui guides us through the intricacies of Ratchet's design, highlighting its ability to enable adaptive query suspension and resumption, sophisticated resource arbitration for diverse workloads, and a fine-grained pricing model to navigate fluctuating resource availability. Join us as we uncover the future of cloud-native databases and workloads, and discover how Ratchet is poised to revolutionize the way we harness the power of dynamic cloud resources.
    Links:
    CIDR'24 PaperRui's LinkedIn Rui's Twitter/XRui's HomepageYou can find links to all Rui's work from his Google Scholar profile.


    Hosted on Acast. See acast.com/privacy for more information.

    • 53 min
    Yifei Yang | Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries | #48

    Yifei Yang | Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries | #48

    In this episode, Yifei Yang introduces predicate transfer, a revolutionary method for optimizing join performance in databases. Predicate transfer builds on Bloom joins, extending its benefits to multi-table joins. Inspired by Yannakakis's theoretical insights, predicate transfer leverages Bloom filters to achieve significant speed improvements. Yang's evaluation shows an average 3.3× performance boost over Bloom join on the TPC-H benchmark, highlighting the potential of predicate transfer to revolutionize database query optimization. Join us as we explore the transformative impact of predicate transfer on database operations.
    Links:
    CIDR'24 PaperYifei's LinkedInBuy Me A CoffeeListener Survey
    Hosted on Acast. See acast.com/privacy for more information.

    • 47 min
    Vikramank Singh | Panda: Performance Debugging for Databases using LLM Agents | #47

    Vikramank Singh | Panda: Performance Debugging for Databases using LLM Agents | #47

    In this episode, Vikramank Singh introduces the Panda framework, aimed at refining Large Language Models' (LLMs) capability to address database performance issues. Vikramank elaborates on Panda's four components—Grounding, Verification, Affordance, and Feedback—illustrating how they collaborate to contextualize LLM responses and deliver actionable recommendations. By bridging the divide between technical knowledge and practical troubleshooting needs, Panda has the potential to revolutionize database debugging practices, offering a promising avenue for more effective and efficient resolution of performance challenges in database systems. Tune in to learn more!
    Links:
    CIDR'24 PaperVikramank's LinkedIn
    Hosted on Acast. See acast.com/privacy for more information.

    • 1 hr 8 min
    Tamer Eldeeb | Chablis: Fast and General Transactions in Geo-Distributed Systems | #46

    Tamer Eldeeb | Chablis: Fast and General Transactions in Geo-Distributed Systems | #46

    In this episode, Tamer Eldeeb sheds light on the challenges faced by geo-distributed database management systems (DBMSes) in supporting strictly-serializable transactions across multiple regions. He discusses the compromises often made between low-latency regional writes and restricted programming models in existing DBMS solutions. Tamer introduces Chablis, a groundbreaking geo-distributed, multi-versioned transactional key-value store designed to overcome these limitations.
    Chablis offers a general interface accommodating range and point reads, along with writes within multi-step strictly-serializable ACID transactions. Leveraging advancements in low-latency datacenter networks and innovative DBMS designs, Chablis eliminates the need for compromises, ensuring fast read-write transactions with low latency within a single region, while enabling global strictly-serializable lock-free snapshot reads. Join us as we explore the transformative potential of Chablis in revolutionizing the landscape of geo-distributed DBMSes and facilitating seamless transactional operations across distributed environments.

    CIDR'24 Chablis PaperOSDI'23 Chardonnay paperTamer's Linkedin
    Hosted on Acast. See acast.com/privacy for more information.

    • 1 hr 2 min
    Matt Butrovich | Tigger: A Database Proxy That Bounces With User-Bypass | #45

    Matt Butrovich | Tigger: A Database Proxy That Bounces With User-Bypass | #45

    Summary:
    In this episode, we chat to Matt Butrovich about his research on database proxies. We discuss the inefficiencies of traditional database proxies, which operate in user-space, causing overhead due to buffer copying and system calls. Matt introduces "user-bypass" which leverages Linux's eBPF infrastructure to move application logic into kernel-space. Matt then tells us about Tigger, a PostgreSQL-compatible DBMS proxy, showcasing user-bypass benefits. Tune in to hear about the experiments that demonstrate how Tigger can achieve up to a 29% reduction in transaction latencies and a 42% reduction in CPU utilization compared to other widely-used proxies.

    Links: Matt's homepageVLDB'23 paperTigger's Github repo

    Hosted on Acast. See acast.com/privacy for more information.

    • 1 hr 3 min

Customer Reviews

5.0 out of 5
4 Ratings

4 Ratings

Top Podcasts In Education

The Mel Robbins Podcast
Mel Robbins
The Jordan B. Peterson Podcast
Dr. Jordan B. Peterson
Small Doses with Amanda Seales
Urban One Podcast Network
Mick Unplugged
Mick Hunt
TED Talks Daily
TED
The Rich Roll Podcast
Rich Roll

You Might Also Like