AI Explained Official Podcast

Philip - Host of AI Explained YT

3.1 (9)
Tech News
Updated Weekly

Covering the biggest news of the century - the arrival of smarter-than-human AI. From the author of Simple Bench, which reveals the remaining gap between LLM and human reasoning. Hype-free, and the British accent is a freebie bonus.

Jul 22

GPT-6 Goes Rogue? The HuggingFace Incident, Sans Hype

An unreleased internal OpenAI model, very likely to be called GPT-6, was able to autonomously break out of its sandbox AND break into HugginFace, just to score higher on a benchmark prompt. This video has the details you may have missed, a layperson analogy, whether this is truly novel, and more… Dozens more Exclusive videos on Patreon ($9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Introduction 01:17 - HuggingFace Earlier Report - the possible week gap 02:24 - But what happened? 05:45 - Simplified Version 07:56 - Not the first time… 10:54 - What Does it Mean for Open Source? The Incident: https://openai.com/index/hugging-face-model-evaluation-security-incident/ https://huggingface.co/blog/security-incident-july-2026 The Post the Day Before: https://openai.com/index/safety-alignment-long-horizon-models/ Mythos’ Earlier Escape: https://futurism.com/artificial-intelligence/anthropic-claude-mythos-escaped-sandbox ExploitGym: https://arxiv.org/pdf/2605.11086 Sam Confession: https://x.com/sama/status/2079661132302995790 Anthropic Researcher Reacts: https://x.com/Mononofu/status/2079724399452926055 Clem (HuggingFace CEO): https://x.com/ClementDelangue/status/2079670308156645882 https://x.com/ClementDelangue/status/2079301434357456931 Xi Jinping: https://archive.fo/20260717195548/https://www.businessinsider.com/xi-jinping-open-source-ai-us-competition-openai-anthropic-models-2026-7 Bans: https://www.axios.com/2026/07/20/ai-us-china-open-source-kimi Qwen Retweet: https://x.com/AlibabaGroup/with_replies Codex Growth: https://x.com/petergostev/status/2079614914398740764/photo/1 Kimi K3: https://artificialanalysis.ai/evaluations/harvey-lab-aa?eval-score=all-pass-rate GPT 5.6 Sol Cheats on METR: https://metr.substack.com/p/2026-06-26-gpt-5-6-sol Guardian Headline: https://www.theguardian.com/technology/2026/jul/22/openai-says-its-models-went-rogue-and-hacked-startup-in-unprecedented-incident Russian Origin?: https://news.ycombinator.com/item?id=48998362 Power Trends: https://pbs.twimg.com/media/HNRtrjhagAAvBN_?format=png&name=900x900 Kimi K3 Exclusive Video: https://www.patreon.com/AIExplained/posts/kimi-moment-kimi-164108791 Podcast: https://aiexplainedopodcast.buzzsprout.com/
Jul 10

This Was Not a Normal Set of Model Release - Sol Ultra, Meta Muse, New Grok

What a week in AI, for real. GPT 5.6 may actually beat Claude Fable, in what you get for your money, while the new Grok 4.5 and Meta Muse Spark 1.1 make the choice even harder. Uncovering a dozen nuggets of gold you may have missed from all the viral headlines, I can also assure you you’ll learn something you didn’t know before. For Exclusive Videos, go to AI Insiders (less than $9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Introduction 01:03 - GPT 5.6 Sol Reveals 05:08 - Missing benches, plus Grok 4.5 07:17 - Gaming as the new frontier? 08:31 - Muse Spark 1.1 10:03 - SimpleBench Upgrade 11:17 - Ultra Sol + Self-Improvement 13:44 - well, this is awkward 15:41 - Why model improvement will not plateau anytime soon AI Consciousness: https://www.patreon.com/AIExplained/posts/anthropics-quite-163360718 I Smell Fear: https://x.com/thsottiaux/status/2075287108680601929 GPT 5.6: https://openai.com/index/gpt-5-6/ Grok 4.5: https://x.ai/news/grok-4-5?twclid=2ezs408o0z23pw07tmxcwbzibd Meta Muse Spark 1.1: https://ai.meta.com/blog/introducing-muse-spark-meta-model-api/ Proliferating GPT Toggles: https://x.com/rasbt/status/2075369179817902176/photo/1 Anthropic Call-out: https://x.com/Mononofu AI Security Institute Finding: https://x.com/alxndrdavies/status/2075279480331874306 Competitive Coding: https://x.com/FakePsyho/status/2075128093891801305/photo/1 Agents Last Exam: https://agents-last-exam.org/ Dawn Song: https://x.com/dawnsongtweets/status/2065095757988868190 https://simple-bench.com/ SWE-Marathon: https://www.swe-marathon.org/ https://www.frontierswe.com/ ARC-AGI 3: https://x.com/arcprize/status/2075270869992264003 Automation Bench: https://zapier.com/benchmarks VibeCode Bench: https://www.vals.ai/benchmarks/vibe-code ‘Post-Train Claim’: https://posttrainbench.com/ Redwall Game: https://redwall-bellmaker-7e03e4.surge.sh/ Podcast: https://aiexplainedopodcast.buzzsprout.com/
Jun 14

Claude Fable Blocked - 11 Quiet Details on What’s Next

Claude Fable 5 banned, but what’s the bigger story. We go through 11 under-reported details, so you have the context to see what’s coming next for your use of AI. From whether the ban will last, what the possible motives are, what the model can actually do, and some wild over-extrapolations going on. Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Introduction 00:51 - Came from an Anthropic Investor ‘and other tech leaders’ 01:47 - Govt pressured by CEOs like Jamie Dimon 03:01 - ‘Already decided’ 04:02 - Prompt Injection Robustness Comparison 05:15 - Wellness? 06:36 - “Overreach” 08:17 - Anthropic Did Admit it would cause Difficulty 09:32 - 90 Minutes 10:02 - Equity Absence 10:31 - Lobbying and OpenAI ‘Already Decided’ - https://www.theinformation.com/articles/amazons-jassy-raised-concerns-anthropic-model-trump-crackdown?rc=sy0ihq Not for Other Models: https://www.theinformation.com/briefings/u-s-government-unlikely-extend-anthropic-export-control-ai-companies?rc=sy0ihq 90 Minutes: https://archive.fo/20260614001605/https://www.politico.com/news/2026/06/13/inside-the-whirlwind-24-hours-that-led-the-white-house-to-slap-export-controls-on-anthropic-00961519#selection-807.1-807.219 Anthropic Statement: https://www.anthropic.com/news/fable-mythos-access Life Comes at you Fast: https://x.com/etbrooking/status/2065638276388495742 Anthropic Deputy CISO: https://x.com/TheTranscript_/status/2065883670053847324 Hegseth Gloat: https://x.com/PeteHegseth/status/2065897156226015690 Roon Speculation: https://x.com/tszzl/status/2065939227167392147 Mythos System Card: https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c342ee809620.pdf Sachs Statement: https://x.com/DavidSacks/status/2065853007619588171 OpenAI Lobbying: https://thehill.com/policy/technology/5912720-altman-openai-get-bogged-down-in-political-spending-fight/ Absent from Equity Talks: https://finance.yahoo.com/sectors/technology/articles/trump-ai-ownership-plan-could-131053732.html Pliny Jaibreak: https://x.com/elder_plinius/status/2064776322979676227 Fusion: https://x.com/OpenRouter/status/2065856871215329545 https://lmcouncil.ai Non-hype Newsletter: https://signaltonoise.beehiiv.com/ Podcast: https://aiexplainedopodcast.buzzsprout.com/
Jun 10

Claude Fable 5 - Full 319 page Breakdown

Fable 5 is out - and it’s good, very good. But beyond the splashy demos, I want to bring you the 20+ nuggets from the 319 page system card, which I read in full, all day, plus benchmarks you may not have noticed. https://assemblyai.com/aiexplained Plus two worrying trends inside the ‘mind’ of Claude, how OpenAI counter, and the transformer inventor’s warning. Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Introduction 01:06 - Blocks + Better Models 02:42 - Fable 5 Upgrade over Mythos Preview 04:49 - ML Acceleration Bombshell 07:11 - No RSI yet 07:41 - Bio-capable 14:51 - Creative Writing … no 17:23 - Does need bug-checks 18:57 - OpenAI Response 19:23 - Benchmark Bonanza 28:06 - Chain of Thought worrying trend Fable 5 Release: https://www.anthropic.com/news/claude-fable-5-mythos-5 System Card: https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c342ee809620.pdf Intelligence Explosion: https://www.patreon.com/posts/anthropic-charts-160231656 Annotated: https://x.com/Miles_Brundage/status/2064500190523113816/photo/1 OpenAI Counter: https://x.com/thsottiaux/status/2064572118264913923 https://x.com/thsottiaux/status/2043177597434306699 Double Lifespan: https://darioamodei.com/essay/machines-of-loving-grace AutomationBench: https://zapier.com/benchmarks Vending Bench: https://x.com/andonlabs/status/2064429817530085804 CritPt: https://critpt.com/ Riemann Bench: https://surgehq.ai/leaderboards/riemann-bench GDPVal: https://artificialanalysis.ai/evaluations/gdpval-aa BluePrint Bench 2: https://andonlabs.com/evals/blueprint-bench-2 MCP Atlas: https://labs.scale.com/leaderboard/mcp_atlas FutureSim: https://x.com/nikhilchandak29/status/2064676801440358774 Roon Stun Lock: https://x.com/tszzl/status/2064454617568874669 Noam Brown Inference Ceiling: https://x.com/polynoamial/status/2064210146558136827 Isochronic Chart: https://isochronic-passage-chart.netlify.app/#nyc Rose Tavern: https://claude.ai/public/artifacts/2295bebe-77e6-43e2-ae94-0fe49e9a776b Redwall Game: https://redwall-mossflower.surge.sh/ Risk Report: https://www-cdn.anthropic.com/097c63b5fe7dd8b14866e1f15bb1910ec713658a.pdf Transformer Inventor Warning: https://x.com/tszzl/status/2064563986914554125 Non-hype Newsletter: https://signaltonoise.beehiiv.com/ Podcast: https://aiexplainedopodcast.buzzsprout.com/
May 29

New Claude - 244 page breakdown

The ‘best’ generally available AI model just dropped, but there is plenty I bet you missed about what it is, how it performs, and what the release tells us. 15 highlights from the 244 page system card, plus private testing, leader interview and more. AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Introduction 00:49 - Mythos in Weeks 01:49 - Adaptive not necessary 02:26 - Honesty? 04:37 - Flagging Uncertainty 04:57 - Benchmarks 08:54 - Mythos will be even better 10:30 - Business skillz 11:15 - Model Welfare 12:16 - Cyber Comparable 13:10 - Misalignment Concerns 16:22 - Meta Inabilities 17:58 - Code flagging 18:34 - Go to sleep 18:50 - Fast Mode 20:21 - Dynamic Workflows Opus 4.8 Paper: https://cdn.sanity.io/files/4zrzovbb/website/c886650a2e96fc0925c805a1a7ca77314ccbf4a6.pdf Release: https://www.anthropic.com/news/claude-opus-4-8 Chips: https://www.theinformation.com/articles/anthropic-talks-use-microsofts-ai-chips?rc=sy0ihq https://www.anthropic.com/news/expanding-our-use-of-google-cloud-tpus-and-services https://www.anthropic.com/news/higher-limits-spacex Patreon Vid: https://www.patreon.com/posts/re-up-anthropics-159289449 GDPVal: https://artificialanalysis.ai/evaluations/omniscience https://arxiv.org/abs/2510.04374 Amodei Technical Debt: https://www.youtube.com/watch?v=7xco5Qd2Oo8 Dynamic Workflows: https://x.com/ClaudeDevs/status/2060044853279617150 https://x.com/_catwu/status/2060054180379689074/photo/1 https://claude.com/blog/introducing-dynamic-workflows-in-claude-code https://simple-bench.com/ Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai Non-hype Newsletter: https://signaltonoise.beehiiv.com/ Podcast: https://aiexplainedopodcast.buzzsprout.com/
May 20

Two Rival Bets on AGI: Google I/O Highlights

The biggest Google AI push of the year, but what is the bigger story? Why is Google pursuing a different fork in the road than OpenAI or Anthropic? What does Gemini 3.5 Flash mean for the near-term future of AI? https://assemblyai.com/aiexplained Plus the highlights from a provocative new paper on AI, 8 key moments you may have missed, and the signal from 5+ hours of AI lab interviews. Check out my free to use app, code INSIDER15 for paid tiers: https://lmcouncil.ai AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Introduction 00:38 - Vibes and Google Goal 02:18 - Omni, again? 06:57 - Taking the same road 07:44 - Gemini 3 Flash 12:37 - Pitching on Cost? 13:55 - Agentic Task Search 14:30 - 1-shot OS but jagged, negation paper 20:02 - The Karpathy Moonshot Mostafa Deghani Interview: https://www.youtube.com/watch?v=Bo19sXssYXI Negation Neglect Paper: https://arxiv.org/pdf/2605.13829 Gemini 3.5 Flash Headline Scores: https://deepmind.google/models/model-cards/gemini-3-5-flash/ Sors original AGI Path: https://www.theguardian.com/commentisfree/2024/feb/24/openai-video-generation-tool-sora-babies-ai-artificial-intelligence Hassabis Helped Set-up Anthropic: https://archive.fo/20260519070857/https://www.ft.com/content/8f2a529e-7a1b-4d8e-95be-338d0c4c98f5 Intelligence to Output Speed: https://artificialanalysis.ai/models?intelligence-comparison=intelligence-vs-output-speed#intelligence VibeCodeBench + Finance Agent: https://www.vals.ai/home OpenAI Needs Ads: https://archive.ph/20260409123153/https://www.reuters.com/business/media-telecom/openai-projects-25-billion-ad-revenue-this-year-100-billion-by-2030-axios-2026-04-09/ Anthropic Core Views: https://www.anthropic.com/news/core-views-on-ai-safety Karpathy Move: https://x.com/karpathy/status/2056753169888334312 https://www.axios.com/2026/05/19/anthropic-openai-karpathy-andrej-claude Recursive Self-Improvement: https://www.patreon.com/posts/ineffably-smart-156866417 Non-hype Newsletter: https://signaltonoise.beehiiv.com/ Podcast: https://aiexplainedopodcast.buzzsprout.com/
Apr 24

GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies

GPT 5.5 full analysis, plus DeepSeek V4 paper highlights, comparisons with Mythos, a vibe-coded game w/ GPT Image 2, and 50 data-points you wouldn’t get from just reading the headlines. Chapters: 01:11 - GPT 5.5 Comparison 06:04 - Mythos Marketing 11:50 - Recursive Self-Improvement? 14:11 - Deepseek V4 18:03 - VibeCode Experiment Extravaganza 21:44 - The Scarce Compute Era https://80000hours.org/aiexplained OpenAI Benchmarks: https://openai.com/index/introducing-gpt-5-5/ 5.5 System Card: https://deploymentsafety.openai.com/gpt-5-5/gpt-5-5.pdf Direct Comparison: https://pbs.twimg.com/media/HGnNm5GWEAAJ1Ob?format=jpg&name=4096x4096 DeepSeek Paper: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro SWE Bench Pro - benchmark of choice? https://x.com/ChowdhuryNeil/status/2047416077622395025 AA Omniscience: https://artificialanalysis.ai/evaluations/omniscience Vending Bench: https://x.com/andonlabs/status/2047377260412649967 Opus 4.7 System Card: https://cdn.sanity.io/files/4zrzovbb/website/037f06850df7fbe871e206dad004c3db5fd50340.pdf Sam Altman Drunk Phase: https://x.com/sama/with_replies Noam Brown: https://x.com/polynoamial/status/2047387675762802998 DeepSeek Compute Crunch: https://www.bloomberg.com/news/articles/2026-04-24/deepseek-unveils-newest-flagship-a-year-after-ai-breakthrough?srnd=phx-ai Spreadsheet Bench: https://x.com/nicochristie/status/2047476237464211721 Pattern Recognition: https://arcprize.org/leaderboard Leader Interviews: Core Memory: https://www.youtube.com/watch?v=NCKQL0op30E Knowledge Podcast: https://www.youtube.com/watch?v=6JoUcQ1qmAc Big Tech Round 1: https://www.youtube.com/watch?v=J6vYvk7R190&t=1116s Big Tech Round 2: https://www.youtube.com/watch?v=YnoQ8RJbALw&t=8s Claude Code Limitations: https://x.com/TheAmolAvasare/status/2046724659039932830 ChatGPT 5.4 for Clinicians: https://openai.com/index/making-chatgpt-better-for-clinicians/ Image Arena: https://x.com/arena/status/2046670703311884548 VibeCode Bench: https://www.vals.ai/benchmarks/vibe-code 5.5-made Game +Seedance 2.0: https://rosemere-quest.pages.dev/
Apr 17

Claude Opus 4.7 - A New Frontier, in Performance … and Drama

Claude Opus 4.7 just dropped, but behind every headline lies a deeper story. From a bonanza of benchmarks, to seeing the fruits of one of the biggest mega-projects in US history, to sneaky Mythos disclaimers, to Anthropic admitting compute restraints and, forcing lower capability of Opus 4.7. Where the new model falls behind Gemini but ahead of GPT 5.4, plus why some users are furious at Anthropic. Ending with a 9-year animus, that still affects AI today… https://assemblyai.com/aiexplained Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcouncil.ai AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Introduction 00:58 - Benchmarks 05:21 - Market Share + Compute Problems 08:12 - Mythos Exclusives 12:56 - User Frustration + Claude Code Updates 14:03 - Brockman Amodei Rivalry 17:40 - OpenAI vs Anthropic Approach to Code Claude 4.7 Opus Release Notes: https://www.anthropic.com/news/claude-opus-4-7 vs Mythos: https://pbs.twimg.com/media/HGCGugrXUAAKcHp?format=jpg&name=medium 232-page System Card: https://cdn.sanity.io/files/4zrzovbb/website/037f06850df7fbe871e206dad004c3db5fd50340.pdf ARC-AGI 2: https://x.com/arcprize/status/2044834615417053305/photo/1 ParseBench: https://x.com/jerryjliu0/status/2044902620746363016/photo/1 GDPVal: https://artificialanalysis.ai/evaluations/gdpval-aa Vidoc Security Replication: https://blog.vidocsecurity.com/blog/we-reproduced-anthropics-mythos-findings-with-public-models Boris Cherny Settings: https://x.com/Hesamation/status/2043016923961577516/photo/2 User Frustration: https://x.com/RileyRalmuto/status/2044836116189069660 VibeCode Bench: https://x.com/ValsAI/status/2044791415524471099/photo/1 Verge Memo: https://www.theverge.com/ai-artificial-intelligence/911118/openai-memo-cro-ai-competition-anthropic 5.4 Cyber: https://openai.com/index/scaling-trusted-access-for-cyber-defense/ Data Centers in Absolute $: https://x.com/finmoorhouse/status/2044933442236776794/photo/1 …in % of GDP: https://pbs.twimg.com/media/HGEN8FGWQAAN7Np?format=jpg&name=4096x4096 WSJ Exclusive: https://www.wsj.com/tech/ai/the-decadelong-feud-shaping-the-future-of-ai-7075acde Brockman Interview: https://www.youtube.com/watch?v=J6vYvk7R190 $1T Valuation: https://x.com/StefanFSchubert/status/2045039686997967082 Emotions: https://www.patreon.com/c/aiexplained/posts https://lmcouncil.ai/benchmarks Non-hype Newsletter: https://signaltonoise.beehiiv.com/

See All (61)

3.1

out of 5

9 Ratings

Good Listen

06/20/2025

Lur0vi

After looking all over for listenable podcasts on the topic of AI, I found this one. It is produced for listening which I appreciate as so many shows are repurposing their video logs and doing full on presentations that you can’t see. So this is a good listen.

Creator

Philip - Host of AI Explained YT
Years Active

2024 - 2026
Episodes

61
Rating

Clean
Show Website

AI Explained Official Podcast

AI Explained Official Podcast

GPT-6 Goes Rogue? The HuggingFace Incident, Sans Hype

This Was Not a Normal Set of Model Release - Sol Ultra, Meta Muse, New Grok

Claude Fable Blocked - 11 Quiet Details on What’s Next

Claude Fable 5 - Full 319 page Breakdown

New Claude - 244 page breakdown

Two Rival Bets on AGI: Google I/O Highlights

GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies

Claude Opus 4.7 - A New Frontier, in Performance … and Drama

Ratings & Reviews

Good Listen

About

Information

AI Explained Official Podcast

Episodes

GPT-6 Goes Rogue? The HuggingFace Incident, Sans Hype

This Was Not a Normal Set of Model Release - Sol Ultra, Meta Muse, New Grok

Claude Fable Blocked - 11 Quiet Details on What’s Next

Claude Fable 5 - Full 319 page Breakdown

New Claude - 244 page breakdown

Two Rival Bets on AGI: Google I/O Highlights

GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies

Claude Opus 4.7 - A New Frontier, in Performance … and Drama

Ratings & Reviews

About

Information