The Test Set by Posit

Posit, PBC

5.0 (28)
Technology

A Posit podcast for data science junkies, anomaly hunters, and those who play outside the confidence interval. Hosted by Michael Chow, with co-hosts Wes McKinney & Hadley Wickham.

14h ago

Curiosity, duty, and existential dread — with Joe Cheng

Joe Cheng is the CTO of Posit and the creator of Shiny. He joins Michael and Hadley to talk about why he almost walked away from AI work entirely over ethics concerns and what it takes to lead a team that didn't necessarily choose you. Plus, why saying yes to everyone is a worse strategy than it sounds. Bonus: Hadley calls out Joe's people-pleasing in real time. What's inside: Joe's 2012 self-doubt spiral that accidentally created ShinyWhy Joe almost quit working on AI entirelyThe "loaded guns" problem with releasing AI toolsHadley's blunt leadership style vs. Joe's people-pleasingNobody actually wanted to make Joe CTO?Joe's take on curiosity, duty, and fear as motivators

1h 6m
Jun 29

Confidently Incorrect — with Caitlin Colgrove

Caitlin Colgrove is the CTO of Hex, the data workspace for building and sharing data projects using SQL and Python that somehow counts a Sweetgreen chef as a power user. She joins Michael, Hadley, and Isabel to talk about what AI agents actually get wrong in data work (it's not the hallucinations, it's supreme overconfidence), why data teams aren't going anywhere, and how she thinks about building products for humans and agents at the same time. What's inside What Hex's Context Studio does, and why it's a data team's new jobMore code is now written in Hex by agents than by humans"My job is to vouch for the correctness of the answer" — redefining the data teamThe vibe-coded CEO PR is coming for your data team (if it hasn’t already)Soulsborne games as couple's therapy, aka, the Elden Ring co-op report

1 hr
Jun 15

The Bothness of It — with Alex Hillman

Alex Hillman built one of America's first co-working spaces, wrote a business book in tweets, and recently handed his inbox to a Claude Code agent — not to draft emails, but to notice when a friendship is going cold. In this episode, Alex, Michael, Wes, and Hadley dig into marketing for people who hate marketing, what 20 years of email reveals about your relationships, and why the hardest part of AI-assisted coding was always before you wrote a single line. What's inside: Marketing is really just listening at scaleBuilding a 20-year relationship database from your sent folder"Hot rod vs. plumbing" — the two kinds of software you build nowWhat early internet and the AI boom have in commonThe case for reading 20-year-old engineering books with a coding agentKaraoke philosophy as a framework for community building

1h 14m
Jun 1

The Code Doesn't Lie — with Mike Bostock

Mike Bostock made D3 when the browser was still a joke. He built bl.ocks when people needed somewhere to share their work. Now he's building Observable — reactive notebooks with an AI that actually looks at what it made. In this episode: the three-GIF bar chart that launched 25 years of viz, why open source needs both intrinsic and extrinsic motivation, and why an agent that can't see its own output is likely to be confidently wrong. What's Inside The 1998 visualization library that could only make bar chartsWhy D3 hit #3 on GitHub, and what killed the galleryWhat spreadsheets got right that notebooks ignored for years"The agent can lie with text, but not with code"Why Observable scrapped canvases and went back to notebooksThe penguin dataset that exposes AIStrength training, tennis mind games, and a resurrected Stanford game

1h 8m
May 18

The Wonder-Driven Builder — with Paige Bailey

Paige Bailey is a developer relations engineering lead at Google DeepMind. She's a geophysicist-turned-AI-engineer who was once told by her professors that building open-source libraries was a waste of time. We talk about her path from planetary science to TensorFlow, why statisticians have a hidden edge in the age of AI, and what it means to be a curious generalist when the cost of building software is approaching zero. Bonus: installing solar-powered silent-film birdhouses as street art in San Francisco. What's inside From planetary science to TensorFlow, before it was GPU-capableGeophysicists as early GPU adoptersThe professors who said open-source wasn’t “real science”Building silent-film birdhouses as San Francisco street artHiding Gemini API tests inside whimsical side projectsThe right-tool-for-the-job case for mixing AI modelsWhy “taste” is the skill that matters when code costs nothing

46 min
May 4

Widgets Are Lego Bricks (and Other Things People Are Sleeping On) — with Vincent Warmerdam

Vincent Warmerdam has been the first full-time hire at a startup, a spacey punster who accidentally got himself a job, a bartender at an Amsterdam comedy theater, and a Dutch bike tour guide — and he'll tell you all of it was career development. Now doing DevRel at Marimo, Vincent makes the case for reactive notebooks, Lego-brick widgets, and why "number go up" is not a data science strategy. Also: chickens die. The model doesn't know. This matters more than you think. What's inside How a spacey pun accidentally launched Vincent's careerWhy Marimo's constraints make it better for LLMs, not just humansThe gorilla hiding in your dataset — and why the model missed itVibe coding vs. notebooks: three cells at a time as a disciplineWidgets as Lego bricks: reusable, composable, criminally underusedCognitive debt, confirmation bias, and sycophantic data scienceWhy natural intelligence is still, actually, a pretty good idea

1h 16m
Apr 20

Everything's a Fad (Including This Podcast) — with Benn Stancil

Benn Stancil built Mode Analytics, spent a decade in the data trenches, and now writes some of the sharpest, funniest essays in the data world. On The Test Set, he talks about the cultural shift from Nate Silver to Rick Rubin why AI might kill the analytics dashboard, and what happens when a thousand startups all build the same thing. Plus: boy bands as a model for collaboration, and why the best creative work starts with cheating. What's inside: Why the modern data stack was basically big data 2.0The cultural flip from Nate Silver to Rick RubinGas Town, tar pits, and the AI startup zero-sum gameSoftware is becoming content, and that changes thingsBenn's creative process: Lorde lyrics, Codenames, and cheatingThe boy band as a model for small-team collaborationBI is (mostly) dead, and vibes might replace SQL

1h 35m
Apr 6

Deeply Unsexy: SQL's Redemption Arc — with Tristan Handy

dbt Labs CEO Tristan Handy drops into The Test Set to map the fault lines between the data science world and the enterprise data world — and explain why analytics engineers are basically pissed-off data analysts who decided to organize the bookshelf. We get into SQL's glow-up, the community magic of dbt Slack, what AI agents mean for data warehouses, and why everyone's building iOS apps with Claude now. What's inside: What analytics engineers *actually* doSQL's journey from deeply unsexy to indispensableHow dbt turned source control into a source of truthBuilding a tech community without the RTFM energyAI agents on your data lake: permissions get personalWill LLMs kill the open-source package ecosystem?Edible gardening, welding dreams, and digital dysphoria

1h 6m

See All (26)

out of 5

28 Ratings

A Posit podcast for data science junkies, anomaly hunters, and those who play outside the confidence interval. Hosted by Michael Chow, with co-hosts Wes McKinney & Hadley Wickham.

Creator

Posit, PBC
Years Active

2025 - 2026
Episodes

26
Rating

Clean
Show Website

The Test Set by Posit

Science

Science

Updated Weekly
Education

Education

Updated Biweekly
Technology

Technology

Updated Biweekly
Technology

Technology

Updated Semiweekly
Technology

Technology

Updated Biweekly
Technology

Technology

Updated Weekly
Technology

Technology

Updated Weekly

The Test Set by Posit

Curiosity, duty, and existential dread — with Joe Cheng

Confidently Incorrect — with Caitlin Colgrove

The Bothness of It — with Alex Hillman

The Code Doesn't Lie — with Mike Bostock

The Wonder-Driven Builder — with Paige Bailey

Widgets Are Lego Bricks (and Other Things People Are Sleeping On) — with Vincent Warmerdam

Everything's a Fad (Including This Podcast) — with Benn Stancil

Deeply Unsexy: SQL's Redemption Arc — with Tristan Handy

Ratings & Reviews

About

Information

You Might Also Like

The Test Set by Posit

Episodes

Curiosity, duty, and existential dread — with Joe Cheng

Confidently Incorrect — with Caitlin Colgrove

The Bothness of It — with Alex Hillman

The Code Doesn't Lie — with Mike Bostock

The Wonder-Driven Builder — with Paige Bailey

Widgets Are Lego Bricks (and Other Things People Are Sleeping On) — with Vincent Warmerdam

Everything's a Fad (Including This Podcast) — with Benn Stancil

Deeply Unsexy: SQL's Redemption Arc — with Tristan Handy

Ratings & Reviews

About

Information

You Might Also Like