50 episodes

Data science, big data, artificial intelligence, machine learning… they’re all the rage. In this podcast, Jessi Cisewski-Kehe and Susan Wang, 2 statisticians, give you a perspective on what’s happening in the realm of all things data. Random bantering included.

Support this podcast: https://podcasters.spotify.com/pod/show/databytes/support

DataBytes Jessi & Susan

    • Science
    • 5.0 • 6 Ratings

Data science, big data, artificial intelligence, machine learning… they’re all the rage. In this podcast, Jessi Cisewski-Kehe and Susan Wang, 2 statisticians, give you a perspective on what’s happening in the realm of all things data. Random bantering included.

Support this podcast: https://podcasters.spotify.com/pod/show/databytes/support

    #50: Extreme Classification: All You Need Is Some Hash (Functions)

    #50: Extreme Classification: All You Need Is Some Hash (Functions)

    In part 2 of this saga on extreme classification, we get into the weeds on how MACH is able to magically handle such massive classification problems. The title says it all -- hash functions are the magical ingredient. We provide a step-by-step view of how one might come up with the MACH algorithm from first principles. 


    ---

    Send in a voice message: https://podcasters.spotify.com/pod/show/databytes/message
    Support this podcast: https://podcasters.spotify.com/pod/show/databytes/support

    • 21 min
    #49: Extreme Classification: Going at MACH Speed (Part 1)

    #49: Extreme Classification: Going at MACH Speed (Part 1)

    In this episode, Dr. Derek Feng drops by to chat about a recent paper on a divide-and-conquer approach (Merged-Averaged Classifiers via Hashing) to massive classification problems. In part 1 (of 2 episodes), we describe the general problem solved by and strategy taken by MACH, wherein the original large classification problem is broken down into smaller-sized classification problems. Next week in the second episode, we talk about more technical details of how the division of labor works, and why it works.


    ---

    Send in a voice message: https://podcasters.spotify.com/pod/show/databytes/message
    Support this podcast: https://podcasters.spotify.com/pod/show/databytes/support

    • 16 min
    #48: Where Moneyball Meets Footy

    #48: Where Moneyball Meets Footy

    We've long heard about the waves that statistics has made in baseball. But what about soccer? In this episode, we summarize a few applications of statistics in European football (or American soccer). 


    ---

    Send in a voice message: https://podcasters.spotify.com/pod/show/databytes/message
    Support this podcast: https://podcasters.spotify.com/pod/show/databytes/support

    • 16 min
    #47: Domoic Acid Testing -- A Crabshoot?

    #47: Domoic Acid Testing -- A Crabshoot?

    Domoic acid has plagued shellfish and other wildlife along the Pacific coastline in recent years. Testing for domoic acid concentration in crabs on a regular basis has become important for determining when crabs and their viscera can be safely consumed. Unlike many other common hypothesis tests, the setup used for domoic acid testing is based on the sample maximum rather than the sample mean. In this episode, we critique the testing methodology. 


    ---

    Send in a voice message: https://podcasters.spotify.com/pod/show/databytes/message
    Support this podcast: https://podcasters.spotify.com/pod/show/databytes/support

    • 18 min
    #46: Finding Your (Niche) Board Games

    #46: Finding Your (Niche) Board Games

    In this episode, we discuss how two statisticians used data from BoardGameGeek.com to put together their own board game recommendation engine, specifically designed to stay away from mainstream recommendations.


    ---

    Send in a voice message: https://podcasters.spotify.com/pod/show/databytes/message
    Support this podcast: https://podcasters.spotify.com/pod/show/databytes/support

    • 12 min
    #45: Learning Publicly, with Private Data

    #45: Learning Publicly, with Private Data

    In this episode, Dr. Derek Feng discusses the general issue of data privacy in the age of big data, including topics of differential privacy and federated learning.


    ---

    Send in a voice message: https://podcasters.spotify.com/pod/show/databytes/message
    Support this podcast: https://podcasters.spotify.com/pod/show/databytes/support

    • 16 min

Customer Reviews

5.0 out of 5
6 Ratings

6 Ratings

cookie_dokie ,

Very nice introduction to data science!

I've been listening to this podcast for the last month, and it provides an easy way to learn about what's going on in the world with context to the artificial intelligence & machine learning revolution taking place. Each episode is no more than 30 minutes for an interesting "byte-size" chunk that is easily digestible. Continue the great work!

Top Podcasts In Science

Hidden Brain
Hidden Brain, Shankar Vedantam
Something You Should Know
Mike Carruthers | OmniCast Media | Cumulus Podcast Network
Radiolab
WNYC Studios
Ologies with Alie Ward
Alie Ward
StarTalk Radio
Neil deGrasse Tyson
Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas
Sean Carroll | Wondery