In Ep 2 we ask: "Panic or Progress? Reading Between the Lines of AI Safety Tests." We unpack the recent Claude Opus 4 "blackmail" test result, OpenAI's new transparency pledge, and why safety evaluations sometimes sound scarier than they are. Listeners will leave with a clear framework for interpreting headline-grabbing safety reports—and practical advice on when to worry, when to wait, and how to separate red flags from red herrings.
Information
- Show
- FrequencyUpdated Semiweekly
- PublishedJune 26, 2025 at 12:09 AM UTC
- Length1h 16m
- Season1
- Episode2
- RatingClean