Learning Bayesian Statistics

Alexandre Andorra

Are you a researcher or data scientist / analyst / ninja? Do you want to learn Bayesian inference, stay up to date or simply want to understand what Bayesian inference is? Then this podcast is for you! You'll hear from researchers and practitioners of all fields about how they use Bayesian statistics, and how in turn YOU can apply these methods in your modeling workflow. When I started learning Bayesian methods, I really wished there were a podcast out there that could introduce me to the methods, the projects and the people who make all that possible. So I created "Learning Bayesian Statistics", where you'll get to hear how Bayesian statistics are used to detect black matter in outer space, forecast elections or understand how diseases spread and can ultimately be stopped. But this show is not only about successes -- it's also about failures, because that's how we learn best. So you'll often hear the guests talking about what *didn't* work in their projects, why, and how they overcame these challenges. Because, in the end, we're all lifelong learners! My name is Alex Andorra by the way. By day, I'm a Senior data scientist. By night, I don't (yet) fight crime, but I'm an open-source enthusiast and core contributor to the python packages PyMC and ArviZ. I also love Nutella, but I don't like talking about it – I prefer eating it. So, whether you want to learn Bayesian statistics or hear about the latest libraries, books and applications, this podcast is for you -- just subscribe! You can also support the show and unlock exclusive Bayesian swag on Patreon!

  1. #155 Probabilistic Programming for the Real World, with Andreas Munk

    1D AGO

    #155 Probabilistic Programming for the Real World, with Andreas Munk

    Support & Resources → Support the show on Patreon → Bayesian Modeling Course (first 2 lessons free): Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work Takeaways: Q: Why is bridging deep learning and probabilistic programming so important? A: Deep learning is extraordinarily good at fitting complex functions, but it throws away uncertainty. Probabilistic programming keeps uncertainty explicit throughout. Combining the two – as in inference compilation – lets you get the expressiveness of neural networks while still doing proper Bayesian inference. Q: What is inference compilation and how does it relate to amortized inference? A: Amortized inference is the general idea of training a model upfront so you don't have to run expensive inference from scratch every single time. Inference compilation is a specific form of amortized inference where a neural network is trained to propose good posterior samples for a given probabilistic program – essentially learning to do inference rather than computing it fresh each query. Q: What is PyProb and what problems does it solve? A: PyProb is a probabilistic programming library designed specifically to support amortized inference workflows. It lets you write probabilistic models in Python and then train inference networks on top of them, making methods like inference compilation practical for real-world simulators and scientific models. Q: What are probabilistic surrogate networks and why do they matter? A: A probabilistic surrogate network is a learned approximation of a complex, expensive simulator that preserves uncertainty. Instead of running a costly simulation thousands of times, you train a surrogate that can answer probabilistic queries much faster – crucial for applications like risk modeling where speed and uncertainty quantification both matter. Chapters: 00:00:00 Introduction to Bayesian Inference and Its Barriers 00:03:51 Andreas Munch's Journey into Statistics 00:10:09 Bridging the Gap: Bayesian Inference in Real-World Applications 00:15:56 Deep Learning Meets Probabilistic Programming 00:22:05 Understanding Inference Compilation and Amortized Inference 00:28:14 Exploring PyProb: A Tool for Amortized Inference 00:33:55 Probabilistic Surrogate Networks and Their Applications 00:38:10 Building Surrogate Models for Probabilistic Programming 00:45:44 The Challenge of Bayesian Inference in Enterprises 00:52:57 Communicating Uncertainty to Stakeholders 01:01:09 Democratizing Bayesian Inference with Evara 01:06:27 Insurance Pricing and Latent Variables 01:16:41 Modeling Uncertainty in Predictions 01:20:29 Dynamic Inference and Decision-Making 01:23:17 Updating Models with Actual Data 01:26:11 The Future of Bayesian Sampling in Excel 01:31:54 Navigating Business Challenges and Growth 01:36:40 Exploring Language Models and Their Applications 01:38:35 The Quest for Better Inference Algorithms 01:41:01 Dinner with Great Minds: A Thought Experiment Thank you to my Patrons for making this episode possible!

    1h 54m
  2. Bitesize | "What Would Have Happened?" - Bayesian Synthetic Control Explained

    APR 2 ·  BONUS

    Bitesize | "What Would Have Happened?" - Bayesian Synthetic Control Explained

    Today's clip is from Episode 154 of the podcast, with Thomas Pinder. In this conversation, Thomas Pinder explains how Bayesian methods naturally lend themselves to causal modeling, and why that matters for real-world business decisions. The key insight is that causal questions in industry are rarely black and white: instead of a single treatment effect, you get a full posterior distribution, credible intervals, and the ability to communicate the probability that an effect is positive, which is far more useful to stakeholders than a p-value. Thomas then dives into Bayesian Synthetic Control, a reframing of the classic synthetic control method from a constrained optimization problem into a Bayesian regression problem. Rather than optimizing weights on a simplex, you place a Dirichlet prior on the regression coefficients, which turns out to be not just mathematically elegant but practically richer: you can express prior beliefs about how many control units are informative, set the concentration parameter accordingly, or let a gamma hyperprior on that parameter let the data decide. The result is a more flexible, less fragile counterfactual, implemented cleanly in PyMC or NumPyro. Get the full discussion here Support & Resources → Support the show on Patreon: https://www.patreon.com/c/learnbayesstats → Bayesian Modeling Course (first 2 lessons free): https://topmate.io/alex_andorra/1011122 Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work at https://bababrinkman.com/ !

    5 min
  3. #154 Bayesian Causal Inference at Scale, with Thomas Pinder

    MAR 25

    #154 Bayesian Causal Inference at Scale, with Thomas Pinder

    • Support & get perks! • Bayesian Modeling course (first 2 lessons free) Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work! Takeaways: Q: Why was GPJax created and how does it benefit researchers? A: GPJax was developed to provide a high-performance, flexible framework for Gaussian processes (GPs) within the JAX ecosystem. It allows researchers to move beyond black-box implementations and easily experiment with custom kernels and model structures while leveraging JAX’s automatic differentiation and GPU acceleration. Q: What are the primary advantages of using Gaussian processes for data modeling? A: Gaussian processes are highly effective at modeling complex, nonlinear relationships in data. Unlike many machine learning methods that only provide a point estimate, GPs offer built-in uncertainty quantification, which is essential for understanding the reliability of predictions in research and industry. Q: How does the GPJax and NumPyro integration enhance probabilistic modeling? A: The integration allows users to treat GPJax models as components within a larger NumPyro probabilistic program. This combination enables the use of advanced sampling techniques like NUTS (No-U-Turn Sampler), making it easier to build and fit complex hierarchical models that include Gaussian processes. Q: What are the main challenges when applying Gaussian processes to high-dimensional data? A: High-dimensional data significantly complicates GP modeling due to the curse of dimensionality and the cubic scaling of computational costs. In high dimensions, defining meaningful distance metrics for kernels becomes harder, often requiring specialized techniques like sparse GPs or dimensionality reduction to remain tractable. Full takeaways here! Chapters: 11:40 What is GPJax and how does it simplify Gaussian Process modeling? 15:48 How are Bayesian methods used for experimentation and causal inference in industry? 18:40 How do you implement Bayesian Synthetic Control? 32:17 What is Bayesian Synthetic Difference-in-Differences? 39:44 What are the research applications and supported methods for the GPJax library? 45:47 What are the primary software and computational bottlenecks when scaling Gaussian Processes? 49:02 What are the real-world industrial applications of Gaussian Process models? 54:36 How is Bayesian modeling applied to soccer and sports analytics? 58:43 What is the future development roadmap for the GPJax ecosystem? 01:05:37 What is Impulso and how does it integrate into a Bayesian modeling workflow? 01:13:42 How do you balance Bayesian computational overhead with industrial latency requirements? 01:20:26 Why is there optimism that scalable Bayesian methods for causal inference are now within reach? Thank you to my Patrons for making this episode possible! Links from the show here!

    1h 26m
  4. #153 The Neuroscience of Philanthropy, with Cherian Koshy

    MAR 11

    #153 The Neuroscience of Philanthropy, with Cherian Koshy

    • Support & get perks! • Bayesian Modeling course (first 2 lessons free) Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work ! Takeaways: Q: Is generosity a natural human trait? A: Yes, generosity is hardwired in our brains and is essential for social interaction. Q: Why do people say they care about causes but not act on it? A: There is often a disconnect between stated care for causes and actual action. Understanding the conditions under which generosity aligns with a person's identity is crucial for bridging this gap. Q: How should fundraising efforts be approached? A: Fundraising should primarily focus on belief updating rather than mere persuasion. Q: What are the benefits of being generous? A: Generosity has significant mental and physical health benefits, as the brain's reward systems activate when we give, making us feel good. Q: How do our beliefs relate to our actions? A: Our beliefs about ourselves strongly influence our actions and decisions, including our decision to be generous. Q: Can generosity impact a community? A: Yes, generosity can be a powerful tool for improving community dynamics. Q: How can technology like AI assist institutions with donors? A: AI could help institutions remember donors better, improving the donor-institution relationship. Chapters:00:00 What's the role of Behavioral Science inPhilanthropy 19:57 What is The Neuroscience of Generosity? 24:40 How can we best understand Donor Decision-Making? 32:14 How can we achieve reframe Beliefs and Actions? 35:39 What is the role of Identity in Habit Formation? 38:06 What is the Generosity Gap in Philanthropy? 45:06 How can we reduce Friction in Donation Processes? 48:27 What is the role of AI and Trust in Nonprofits? 52:11 How can we build Predictive Models for Donor Behavior? 55:41 What is the role of Empathy in Sales and Stakeholder Engagement? 01:00:46 How can we best align ideas with Stakeholder Beliefs? 01:02:06 How can we explore Generosity and Memory? Thank you to my Patrons for making this episode possible! Links from the show:Come meet Alex at the Field of Play Conference in Manchester, UK, March 27, 2026! https://www.fieldofplay.co.uk/Bayesian workflow agent skillNeurogiving, The Science of Donor Decision-MakingCherian's websiteCherian's press kitLBS #89 Unlocking the Science of Exercise, Nutrition & Weight Management, with Eric Trexler

    1h 9m
  5. #152 A Bayesian decision theory workflow, with Daniel Saunders

    FEB 26

    #152 A Bayesian decision theory workflow, with Daniel Saunders

    • Support & get perks! • Proudly sponsored by PyMC Labs! • Intro to Bayes and Advanced Regression courses (first 2 lessons free) Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work ! Chapters:00:00 The Importance of Decision-Making in Data Science 06:41 From Philosophy to Bayesian Statistics 14:57 The Role of Soft Skills in Data Science 18:19 Understanding Decision Theory Workflows 22:43 Shifting Focus from Accuracy to Business Value 26:23 Leveraging PyTensor for Optimization 34:27 Applying Optimal Decision-Making in Industry 40:06 Understanding Utility Functions in Regulation 41:35 Introduction to Obeisance Decision Theory Workflow 42:33 Exploring Price Elasticity and Demand 45:54 Optimizing Profit through Bayesian Models 51:12 Risk Aversion and Utility Functions 57:18 Advanced Risk Management Techniques 01:01:08 Practical Applications of Bayesian Decision-Making 01:06:54 Future Directions in Bayesian Inference 01:10:16 The Quest for Better Inference Algorithms 01:15:01 Dinner with a Polymath: Herbert Simon Thank you to my Patrons for making this episode possible! Links from the show:Come meet Alex at the Field of Play Conference in Manchester, UK, March 27, 2026! https://www.fieldofplay.co.uk/A Bayesian decision theory workflowDaniel's website, LinkedIn and GitHubLBS #124 State Space Models & Structural Time Series, with Jesse GrabowskiLBS #123 BART & The Future of Bayesian Tools, with Osvaldo MartinLBS #74 Optimizing NUTS and Developing the ZeroSumNormal Distribution, with Adrian SeyboldtLBS #76 The Past, Present & Future of Stan, with Bob Carpenter

    1h 19m
  6. #151 Diffusion Models in Python, a Live Demo with Jonas Arruda

    FEB 12

    #151 Diffusion Models in Python, a Live Demo with Jonas Arruda

    • Support & get perks! • Proudly sponsored by PyMC Labs! • Intro to Bayes and Advanced Regression courses (first 2 lessons free) Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work ! Chapters: 00:00 Exploring Generative AI and Scientific Modeling 10:27 Understanding Simulation-Based Inference (SBI) and Its Applications 15:59 Diffusion Models in Simulation-Based Inference 19:22 Live Coding Session: Implementing Baseflow for SBI 34:39 Analyzing Results and Diagnostics in Simulation-Based Inference 46:18 Hierarchical Models and Amortized Bayesian Inference 48:14 Understanding Simulation-Based Inference (SBI) and Its Importance 49:14 Diving into Diffusion Models: Basics and Mechanisms 50:38 Forward and Backward Processes in Diffusion Models 53:03 Learning the Score: Training Diffusion Models 54:57 Inference with Diffusion Models: The Reverse Process 57:36 Exploring Variants: Flow Matching and Consistency Models 01:01:43 Benchmarking Different Models for Simulation-Based Inference 01:06:41 Hierarchical Models and Their Applications in Inference 01:14:25 Intervening in the Inference Process: Adding Constraints 01:25:35 Summary of Key Concepts and Future Directions Thank you to my Patrons for making this episode possible! Links from the show: - Come meet Alex at the Field of Play Conference in Manchester, UK, March 27, 2026! - Jonas's Diffusion for SBI Tutorial & Review (Paper & Code) - The BayesFlow Library - Jonas on LinkedIn - Jonas on GitHub - Further reading for more mathematical details: Holderrieth & Erives - 150 Fast Bayesian Deep Learning, with David Rügamer, Emanuel Sommer & Jakob Robnik - 107 Amortized Bayesian Inference with Deep Neural Networks, with Marvin Schmitt

    1h 36m

Trailers

Ratings & Reviews

4.7
out of 5
66 Ratings

About

Are you a researcher or data scientist / analyst / ninja? Do you want to learn Bayesian inference, stay up to date or simply want to understand what Bayesian inference is? Then this podcast is for you! You'll hear from researchers and practitioners of all fields about how they use Bayesian statistics, and how in turn YOU can apply these methods in your modeling workflow. When I started learning Bayesian methods, I really wished there were a podcast out there that could introduce me to the methods, the projects and the people who make all that possible. So I created "Learning Bayesian Statistics", where you'll get to hear how Bayesian statistics are used to detect black matter in outer space, forecast elections or understand how diseases spread and can ultimately be stopped. But this show is not only about successes -- it's also about failures, because that's how we learn best. So you'll often hear the guests talking about what *didn't* work in their projects, why, and how they overcame these challenges. Because, in the end, we're all lifelong learners! My name is Alex Andorra by the way. By day, I'm a Senior data scientist. By night, I don't (yet) fight crime, but I'm an open-source enthusiast and core contributor to the python packages PyMC and ArviZ. I also love Nutella, but I don't like talking about it – I prefer eating it. So, whether you want to learn Bayesian statistics or hear about the latest libraries, books and applications, this podcast is for you -- just subscribe! You can also support the show and unlock exclusive Bayesian swag on Patreon!

You Might Also Like