Deep Dive

Deep Dive

Deep Dive is long-form research on AI, tech, and the global economy. Single host, weekly episodes, 25-35 minutes each. The story behind every headline — built from primary sources and original analysis. Recent topics: • AI deanonymization research • Data center infrastructure economics • Strait of Hormuz geopolitics • Agentic AI security • Frontier model behaviors Find Deep Dive across platforms: 📺 YouTube · @DeepDiveAIShow 📱 TikTok · @notdeepdiveai 📷 Instagram · @notdeepdive 🔗 All links · linktr.ee/notdeepdive Tap follow for new episodes.

  1. Why Only Three Companies Make the Memory That Runs AI

    hace 2 días

    Why Only Three Companies Make the Memory That Runs AI

    Three companies make the memory that every AI chip on Earth depends on. They're sold out years in advance, their margins beat NVIDIA's, and in May 2026 all three bought a stake in their own biggest customer. The supplier became the owner. This episode goes inside one component: HBM, the high-bandwidth memory bolted next to every AI accelerator — and the hardest chip on Earth to make. Two questions the headlines skip: why can only three firms make it, and what happens when the supplier owns the customer. The answer to the first is arithmetic. An HBM chip is twelve to sixteen wafer-thin memory dies stacked vertically, and the package only works if every die works — so yield multiplies instead of averaging. At a 95% per-die yield, a sixteen-high stack collapses to about 44%. A new entrant doesn't need to be good; it needs to be near-perfect — and NVIDIA reportedly relaxed its own HBM4 spec to fit what the three best memory makers on Earth can actually yield. Three firms making essentially all of it, sold out years forward, is an oligopoly — and SK hynix posted a 72% operating margin in early 2026, an all-time high that outshines the company buying its chips. Memory is now 63% of an AI chip's component cost, and roughly 90% of all HBM is made in one country: South Korea. The move that anchors it all: on May 28, 2026, Micron, Samsung, and SK hynix all joined Anthropic's $65 billion round at a $965 billion valuation — the first time the entire memory oligopoly took equity in a single AI lab. Which leaves the real question: a durable moat, or the top of a violent cycle? Memory has always been brutally boom-bust, the bears cluster on a 2027 inflection, and all three crossed a trillion dollars in market value within 21 days of each other. The honest read holds both — a real moat, with a real expiration date. Deep Dive goes beneath the headline on the stories that matter — AI, finance, security, and the systems running underneath. RELATED EPISODES The AI Chip War: Why Everyone's Watching the Wrong Fight — named memory as the binding AI bottleneck; this goes inside it. NVIDIA Just Forecast $91 Billion Without China — memory eating the AI chip's bill of materials, now 63%. How LLM Inference Actually Works — the memory wall, and the wafer-scale SRAM bet that routes around it. Why Wall Street Is Betting Billions on Nuclear Power for AI — the same 'AI needs X' thesis priced into trillion-dollar firms. CHAPTERS 00:00 Three firms, sold out, and they bought their customer 01:04 Why only three: the yield math 05:14 Why China can't just become the fourth 06:21 An oligopoly that's sold out for years 08:11 Margins that outshine NVIDIA 10:18 Memory is 63% of the chip — who pays for it 13:21 The Korea chokehold 15:59 The supplier becomes the owner 17:58 A durable moat, or the top of a cycle? 20:34 Can anyone route around HBM? SOURCES HBM stack-yield is multiplicative — at 95% per die, a 16-high stack ≈ 44%; one bad die kills it (nomadsemi; SemiEngineering) Three firms ≈ 97% of HBM, ~90% in Korea; NVIDIA reportedly relaxed its HBM4 spec to fit yields (AI Frontiers; TrendForce) Samsung HBM3E ~18 months late; dismantled its HBM team in 2019 (Tom's Hardware; SamMobile) SK hynix Q1 2026: 72% operating margin, sold out ~3 years; Micron 75% gross margin (CNBC; Yahoo Finance) HBM = 63% of AI-chip component cost; DDR5 kit ~$80–100 → $350–600; Dec 2024 BIS HBM rule (Epoch AI; BIS/CSIS) Anthropic $65B round at $965B — all three memory makers join, May 28 2026; Intel customer-equity counter-precedent (TechCrunch; Bloomberg) ——— This episode is for educational and informational purposes only and does not constitute financial, investment, or trading advice, nor a recommendation to buy or sell any security. Deep Dive is not a registered investment adviser. All investing involves risk. Consult a licensed financial professional before making investment decisions.

    23 min
  2. Robinhood Let AI Trade Your Money — And Said It's Not Responsible. Can It Actually Do That?

    hace 3 días

    Robinhood Let AI Trade Your Money — And Said It's Not Responsible. Can It Actually Do That?

    On May 27, 2026, Robinhood did something no major U.S. broker had done: it handed autonomous AI agents the keys to live stock trading. Paste one link into ChatGPT or Claude, fund an account, and the AI can buy and sell for you — and you can switch off the part where it asks first. It's rolling out to all 27.4 million customers. Then there's the fine print. Robinhood's terms say it "does not control, supervise, monitor, recommend, or audit" the agents and is "not responsible for losses." Every take so far stops there: you're on your own, be careful. But that's not the real story. The real story is whether that sentence even holds. A brokerage may not be able to waive its duty to supervise — FINRA Rule 3110 — and that rule was written for a human, not a chatbot you brought from outside. By law, you can't quietly delete a securities rule; Robinhood's own customer agreement even concedes its liability limits don't touch your FINRA rights. The disclaimer is a bet, not a shield — and it gets tested the first time an agent blows up a real account. We trace how it actually works (and why prompt injection makes it dangerous), then the part nobody's connecting: every previous time automated finance broke, someone with a legal duty paid. Knight Capital's bug cost the firm about $440 million in 45 minutes — and the firm ate it. The robo-advisors stayed fiduciaries; when Schwab's quietly skimmed, it paid about $187 million back to clients. Robinhood itself paid the largest fine in FINRA history in 2021 — the same year it froze GameStop trading and its CEO testified before Congress. Every time, a party with a duty paid. This time, that duty's been engineered out. The cold-open twist: two weeks before launch, Robinhood's own chief legal officer — a former SEC commissioner — warned at FINRA's conference that third-party AI giving financial advice with no oversight is dangerous, and pitched a safer "walled garden, not scraping Reddit." Then his company shipped the open opposite. We give the strongest case that this is fine (it's self-directed trading; the SEC just blessed "tool, not advice"), plus three dated predictions. The bigger question under all of it: can a regulated middleman hand your money to an autonomous AI — and walk away from the result? Whatever breaks here writes the rules for agentic commerce everywhere. Deep Dive goes beneath the headline on the stories that matter — AI, finance, security, and the systems running underneath. RELATED EPISODES When AI Agents Go to Court — who's liable when the actor is an AI How AI Agents Actually Work — the engine now trading your account Why the Fed Summoned Wall Street's CEOs Over an AI Model — AI meets regulated finance Will AI Replace Software Engineers? What the Data Actually Says — same hand-the-keys question, next domain CHAPTERS 00:00 Disclaimer — none of this is investment advice 00:08 His own lawyer warned against it 00:43 Robinhood hands AI the keys to 27M accounts 01:45 How it actually works (and prompt injection) 04:31 "We're not responsible" — is that even allowed? 06:26 Who paid, every time automation broke before 09:01 Why Robinhood did it: the money 10:34 The case that it's fine 12:34 Three predictions 13:18 The verdict SOURCES Robinhood agentic-trading launch + terms (TechCrunch); Q1 PFOF $623M 10-Q FINRA Rule 3110 + 2026 Oversight Report; Gallagher remarks, FINRA conf (May 2026) SEC tool-not-advice staff letter (Apr 2026) — does NOT address AI agents Precedents: Knight Capital; Flash Crash; Schwab robo Fair Fund; Robinhood FINRA California AB 316; Air Canada ruling (Moffatt) ——— This episode is for educational and informational purposes only and does not constitute financial, investment, or trading advice, nor a recommendation to buy or sell any security. Deep Dive is not a registered investment adviser. All investing involves risk. Consult a licensed financial professional before making investment decisions.

    14 min
  3. Will AI Replace Software Engineers? What the Data Actually Says

    hace 4 días

    Will AI Replace Software Engineers? What the Data Actually Says

    Sixteen senior developers used the best AI coding tools — Cursor, Copilot, Claude — on real tasks at real companies. They finished 19% slower than when they worked without AI. And they were sure the AI had sped them up — by about 20%. The researchers ran the experiment again in February 2026 to settle it, and the experiment itself collapsed: developers refused to do the tasks without the AI. The counterfactual is gone. The experiment to measure if AI helps engineers has lost its control group. Meanwhile, the money is undeniably real. Cursor crossed $2 billion ARR. GitHub Copilot has 4.7 million paying seats. Claude Code reportedly hit $2.5 billion in its first year. Someone is paying, at scale, for something we cannot reliably measure. The wins on bounded tasks are real — Stripe runs 1,300 AI-written merges per week, Rakuten agents tackle problems inside 12.5-million-line codebases. But two-year telemetry across 22,000 developers shows the cost: code churn up 800%, bugs per developer up over 50%, deployments per week DOWN 11.7%. More code, generated faster, shipping slower. The cut is landing entirely on the first rung. Entry-level engineering postings fell 60% from 2022 to 2024. Programmer employment for ages 22 to 25 is down 20%. India's four biggest IT firms added 3,910 net employees over a year — firms that used to hire 10,000 in a single quarter. Amazon's CEO said work that used to take 40 engineers now takes six. Stripe's leadership worries out loud what entry-level looks like in ten years. We're automating the junior work that makes a senior, with no plan for how to make seniors without those years. The optimists have history. Compilers, offshoring, spreadsheets — each was supposed to end jobs, and didn't (there are four times as many accountants now). But three things are new: speed (toy to threat in three years), scope (it replaces ladders, not tasks), and the strangest — we can no longer measure what we're trading. The verdict: no, AI is not replacing software engineers the way the headlines mean. But yes, it's already replacing the on-ramp. The danger was never the robot that codes — it's the missing rung on the ladder. We're sawing it off while flying blind. One bold prediction: before the end of 2026, a big-name company that quietly stopped hiring junior engineers reverses course. The trigger won't be quality — engineering pipelines fail slower than that. It'll be one CFO deciding the math looks wrong, before the damage shows. Sixty percent confidence, not ninety. RELATED EPISODES How AI Agents Actually Work — same bounded-vs-fuzzy split The AI Layoff Gap — macro layoff narrative; here it's the engineering cut When AI Agents Go to Court — liability when an AI PR breaks production The Loop Closed in the Sandbox — AI doing AI's own work, shipped to every engineer CHAPTERS 00:00 19% slower (and they couldn't tell) 00:34 A productivity gain nobody can measure 01:10 The money is real 02:12 The METR study, and why the experiment broke 05:15 Maybe we're measuring the wrong thing 05:46 Why the code is getting worse 08:13 The benchmark scandal 08:55 The cut lands on juniors 12:21 The pipeline time bomb 13:50 What history says — and what's different 15:44 Denmark, and the verdict 16:33 The bold prediction SOURCES METR (Jul 2025 + Feb 2026): 19% slower with AI; experiment broke when devs refused to work without it NBER (Feb 2026): 9-in-10 firms report no measurable AI impact Goldman Sachs (Mar 2026): AI ~zero to US GDP; ~30% gain in narrow uses including software Faros AI (22,000 devs, 2 yr): churn +800%, bugs +50%, deploys -11.7% SWE-bench Verified ~80% → held-out SWE-bench Pro ~46% (contamination) BLS / Stanford / NY Fed: entry postings -60%, ages 22-25 -20%, CS-grad underemployment >40% Cursor ~$2B ARR, Copilot 4.7M seats, Claude Code ~$2.5B ARR (single-source) Org cases: Shopify, Amazon 40→6 (Jassy 2025), Stripe 1,300/wk, Klarna rehire, India big-4 +3,910 Jevons history; Denmark NBER null; Anthropic CEO forecast

    18 min
  4. How Close Are We to Self-Driving Cars in 2026?

    hace 5 días

    How Close Are We to Self-Driving Cars in 2026?

    In March 2026, Waymo crossed half a million paid rides a week with nobody in the driver's seat — tenfold growth in two years. Two months later it pulled its cars from four cities for driving into floods. Both are true, and the gap between them is the whole story: there's no longer one self-driving car industry. There are two, and they share a name and almost nothing else. One quietly became a real business. Waymo has driven 170 million miles with no one in the seat, and the reinsurer Swiss Re found it has 88% fewer property-damage and 92% fewer injury claims than humans. The real tell isn't the safety number, though — it's that the scorecard changed, from how often a safety driver grabs the wheel to paid rides per week, compounding. You can't fake half a million paid driverless rides. But the same month it crossed over, it hit the wall: everywhere it's cleared to drive adds up to about 1,400 square miles — the size of Rhode Island — with no service in a single snowy city. Tesla makes the opposite bet — cameras only, drive anywhere — but it hasn't shipped: a thirty-car Austin pilot, most with a human aboard, and two crashes caused by the teleoperator, not the AI. A third player may be biggest of all: China's Apollo Go hit the same quarter-million-rides-a-week line the same month, on a car that costs a quarter as much. And does any of it make money? Two numbers circulate for a Waymo ride — $1.40 and $43 a mile — both correct: the next trip's cost, versus the whole company divided by its miles, with Google eating about $40 of every one. The real bottleneck isn't red tape — it's transparency: US crash-reporting was relaxed exactly as the fleets scale. A jury hit Tesla with a $243 million Autopilot verdict — the first crack in the carmaker's wall, though it blamed the driver for two-thirds. So how close are we? Both answers are true: closer than the skeptics think, since a real paid driverless service exists and is growing — and further than the believers think, since it only works inside a sunny, mapped box. The robotaxi is no longer a promise. It's a product. It's just a product that, as of this month, still pulls over when it rains. RELATED EPISODES How AI Agents Actually Work — the same 'autonomy we want but can't fully trust' tension; a robotaxi is that tension at 40 mph (nobody in the seat, 70 people on call to advise) The Humanoid Robot Race — the 'read the production filings, not the press releases' move; the robotaxi version is the paid-ride curve + the NHTSA crash filings The AI Layoff Gap — there the narrative ran ahead of the data; self-driving is the mirror image (the leader's data finally caught up, the laggard's narrative still leads by a decade) CHAPTERS 00:00 Two industries that share a name 01:08 Waymo — the one that crossed over 02:16 The safety data even the skeptic trusts 03:21 The wall — where it still breaks 06:14 Tesla's opposite bet 08:59 China's bet — Apollo Go 09:35 Does any of this make money? 12:03 The rules — and the crack the car falls through 13:28 Who pays when it crashes 14:12 The verdict — and the bets SOURCES Waymo — 170.7M rider-only miles, ~500K paid rides/week (March 2026, ~10x in two years) Swiss Re reinsurance study: ~88% fewer property, ~92% fewer injury claims vs humans (+ peer-reviewed analysis) Waymo May 2026: flood pauses (San Antonio, Atlanta), freeway-service pause, NHTSA recall Tesla robotaxi — NHTSA crash disclosures (Austin pilot; two teleoperator-caused crashes) Baidu Apollo Go — ~250K driverless rides/week in China (Oct 2025); ~1/4 of Waymo's vehicle cost Waymo economics — Morgan Stanley ~$1.40/mi marginal vs ~$43/mi fully-loaded; Alphabet Other Bets -$2.1B on $411M (Q1 2026) Cruise (~$9B, GM) + Argo (~$7B, Ford/VW) shutdowns — the AV capital graveyard Benavides v. Tesla — $243M Autopilot verdict (2026); NHTSA crash-reporting relaxation; Goldman insurance projection

    16 min
  5. How Anyone Can Strip the Safety Out of an Open-Source AI Model

    hace 5 días

    How Anyone Can Strip the Safety Out of an Open-Source AI Model

    There's a free tool you can install with a single command. Point it at a downloaded AI model — Llama, Gemma, Qwen — and in about thirty minutes, on a used gaming card, it permanently removes the model's ability to refuse. It's called Heretic, it hit number one on GitHub, and it works because of a quiet discovery: a model's refusal isn't woven all through its mind — it's a single direction in the math, and the safety often runs only a few tokens deep. But the tool isn't the real story. Heretic works on Llama, Gemma, Qwen, and Mistral because those labs shipped the cheap, removable kind of safety — a refusal layer that peels right off. Durable safety provably exists. One major lab filtered the dangerous knowledge out of its model before training, and a research team that wove safety in during pretraining watched it survive ten thousand attempts to strip it, where the bolt-on kind collapses in a few hundred. Most labs simply didn't pay for it. We take the honest counter-case seriously. The "safe" models over-refuse — one benchmark caught the most cautious model rejecting ninety-nine percent of perfectly harmless questions — and most demand is ordinary: privacy, fiction, research. Has a stripped model actually caused real-world harm? Almost none on record; the scary names like WormGPT were a different method entirely. But that empty column is its own kind of warning — abliteration runs on the attacker's own machine, with no call home and nothing to log. Then there's the law. The European Union built the world's most aggressive AI law, deciding danger by a single number: how much computing power went into training them. The small and mid-sized models Heretic targets sit below that line, so no one is ever required to safety-test them — and the person who strips the safety spends a few cents of compute, far too little to ever become the legally responsible owner. Enforcement goes live August 2nd, 2026, against a gap a one-command tool walks straight through. The scandal, if there is one, is quieter than the headlines: the safety on most open models was built to be the kind that comes off, and the law written to catch that is watching the wrong number. The question was never whether open AI can be made safe. It's whether anyone selling it decides to. RELATED EPISODES Claude Mythos: The AI That Breaks Everything — predicted the open-source gating endgame ('it becomes an arms race'); this episode is the empirical confirmation The Brand Survives the Arrests (ShinyHunters) — a removable control plus an industrialized exploit pipeline, the same security shape The Mandate That Couldn't Be Met (Palo Alto CVE) — when the rule polices the wrong thing; the regulatory-gap parallel CHAPTERS 00:00 The one-command tool — and how abliteration works 05:11 Who wants this — and the over-refusal trap 05:57 Does stripping the safety keep the model smart? 08:12 Why a release can't be recalled — and has it caused harm? 09:34 What the labs already know — and gpt-oss 12:42 The law that measures the wrong number 15:37 What a truly safe open model would take 16:37 The verdict SOURCES Heretic — open-source abliteration tool (#1 trending on GitHub, Nov 2025); creator comments via the Financial Times Arditi et al., 'Refusal in Language Models Is Mediated by a Single Direction' (NeurIPS 2024) OpenAI gpt-oss — open-weight release with CBRN-filtered pretraining + adversarial misuse evaluation (Aug 2025) EleutherAI & UK AI Security Institute — 'Deep Ignorance': data-filtered durable safety that survived ~10,000 fine-tuning steps Badllama — safety removed from a Llama model in ~1 minute on a single GPU, for pennies EU AI Act — GPAI compute threshold (10^25 FLOP), open-source exemption, and penalty enforcement live August 2, 2026 OR-Bench (Cui et al.) — over-refusal benchmark; the most cautious model rejected ~99% of benign prompts US NTIA (2024) — found insufficient evidence to restrict open model weights

    18 min
  6. The SpaceX IPO: Why the Biggest IPO in History Loses Money

    26 may

    The SpaceX IPO: Why the Biggest IPO in History Loses Money

    On May 20th, 2026, SpaceX filed to become the largest public offering in history. Bankers are guiding toward roughly $1.75 trillion — about 94 times sales — and the share price on the cover of the filing is left blank. Here's the part that should stop you: last year SpaceX lost $4.9 billion, it carries a $41.3 billion accumulated deficit, and only one of its three businesses actually makes money. We read the actual S-1. Starlink is the engine — $11.4 billion in revenue, $4.4 billion in operating income. The rocket business runs a small operating loss (after $3 billion a year on Starship), and the xAI division lost $6.4 billion on $3.2 billion of revenue. In 2024, SpaceX turned its first profit — $791 million. Then it bought Elon Musk's AI company, xAI, and once the results combine, that profit becomes a $4.9 billion loss. The rocket company is even filing as a software company — and 93% of the $28.5 trillion market it points to is the AI division losing the most. Two things make the price work. First, SpaceX's largest disclosed AI customer is Anthropic, paying $1.25 billion a month to rent computing power — on a contract either side can cancel with 90 days' notice. Second, in the weeks before the filing, the index rules quietly changed: Nasdaq cut its waiting period, and the S&P opened a proposal to waive its profitability requirement for exactly these companies. If adopted, it would force trillions in index-fund money to buy SpaceX regardless of the numbers. Then we value it honestly, in both directions. A research firm's sum-of-the-parts and Aswath Damodaran's independent model both land near $1.2 trillion — leaving a half-trillion-dollar gap to the asking price with no financial justification, only a story. The bull case is real too: Starlink is close to a monopoly, grew revenue about 50% last year at ~40% operating margins, and could scale its $4.4 billion profit toward $18 billion. But the base rate is sobering — about 9 of 10 comparable growth IPOs underperformed the market in their first year. SpaceX is an extraordinary company. The question is whether $1.75 trillion is an extraordinary price. This episode is analysis and opinion, not investment advice. Do your own research. RELATED EPISODES How Anthropic Actually Makes Money — this S-1 is the document that revealed the Anthropic-SpaceX compute lease; the owner-vs-renter depreciation flip How the Statute of Limitations Killed Musk v. Altman — covered the Feb 2026 xAI->SpaceX merger + the 3-way IPO governance comparison The Last Independent: Cerebras — the revenue-vs-valuation IPO pattern this echoes at 60x the scale CHAPTERS 00:00 A $1.75 trillion IPO that loses money 01:02 Three businesses, only one makes money 02:26 What $1.75 trillion is actually paying for 04:08 How the profit became a $4.9 billion loss 05:50 The Anthropic compute deal 06:56 The index rules engineered to force the bid 09:42 What it's worth: the $1.2 trillion floor 11:00 The bull case for Starlink 13:25 Governance and who's actually buying 14:18 What history says happens next 15:26 The verdict and two predictions SOURCES SpaceX Form S-1 (May 20, 2026; CIK 0001181412) — SEC EDGAR; Reuters, Axios, TechCrunch Independent valuations ~$1.2T: Aswath Damodaran (NYU Stern) + research-firm sum-of-the-parts Nasdaq-100 fast-entry change (May 1, 2026); S&P DJI profitability-waiver consultation (closes May 28, 2026) SpaceX–Anthropic compute deal — S-1 ($1.25B/month, 90-day mutual termination) Tesla S&P 500 inclusion (Dec 2020) — forced-buying precedent; IPO first-year base rates (Rivian, Meta) ——— This episode is for educational and informational purposes only and does not constitute financial, investment, or trading advice, nor a recommendation to buy or sell any security. Deep Dive is not a registered investment adviser. All investing involves risk. Consult a licensed financial professional before making investment decisions.

    17 min
  7. Why Wall Street Is Betting Billions on Nuclear Power for AI

    25 may

    Why Wall Street Is Betting Billions on Nuclear Power for AI

    In the spring of 2026, a company filed to go public at a $1.66 billion valuation — with no revenue, a "going concern" warning from its own auditors, and a reactor that exists only as an eight-inch test well. It was the second time it had tried to go public. And in the same six weeks, two other energy companies raised about $3 billion between them, all selling Wall Street the same one-sentence pitch: AI needs power. This episode reads that IPO wave as a capital-markets bet, not an energy explainer. Put the three companies — Fervo (geothermal), X-energy (small nuclear reactors), and Deep Fission (a deep-borehole reactor) — on a single risk ladder, and a pattern appears: the market priced engineering risk and time-to-power before it priced revenue. Fervo, the one with actual revenue, booked about $140,000 of it last year — against a $10 billion valuation. Even the banks sorted the deals by risk before the rest of us did: the blue-chip names took Fervo and X-energy; Deep Fission got the second-tier shops. Then the parts the headlines skip. Where the money actually goes (the durable margin sits upstream, in enriched fuel and a cleared spot on the grid — not the celebrated reactor startups). Whether the demand is even real (it's growing fast, but gas, not nuclear, is the actual near-term bridge, and the grid operators are already trimming their forecasts). And the question a federal ruling decides by the end of June: when a data center forces an expensive grid upgrade, who pays for it — the data center, or you? We close on the history that should worry you, the genuine bull case, and an honest update to a prior call that nuclear-for-AI was "mostly marketing" — early, not wrong, with a carve-out geothermal earned on a real cost curve. The through-line: the market is buying the sentence — AI needs power — not yet the megawatts, because the megawatts mostly aren't here. Plus two dated predictions. RELATED EPISODES The Real Cost of AI: Who's Actually Paying for the Build-Out — the ratepayer/cost-incidence side this episode updates (and the 'nuclear is mostly marketing' call it revisits) How Microsoft Is Restarting Three Mile Island — why restarts, not new builds, add electrons The Last Independent: Cerebras — the revenue-vs-valuation IPO pattern this wave echoes The AI Chip War — the bottleneck progression (logic → packaging → memory → power) this is the next chapter of CHAPTERS 00:00 Disclaimer — analysis, not investment advice 00:08 Cold open — a $1.66B reactor that doesn't exist yet 01:15 Three things to keep in front of you 01:36 Why power became the bottleneck 02:59 Three companies, one bet — the IPO wave 04:19 The risk ladder: producing vs. pre-revenue 06:40 Where the money actually goes (fuel + the grid) 08:03 Is the AI-power demand even real? 10:00 Who actually pays (your bill, and Phoenix) 11:22 The history that should worry you 14:15 The verdict, and two predictions 15:11 Where I land — the story vs. the megawatts SOURCES Deep Fission S-1 (May 20, 2026) — SEC EDGAR (CIK 0001918102); Bloomberg, Axios, WNN X-energy IPO (Apr 23, 2026); Fervo Energy IPO (May 12, 2026) — Fervo PR, Bloomberg, Fortune LBNL/DOE 2024 data-center electricity report (4.4% → 6.7-12% by 2028); IEA Electricity 2025 FERC Docket RM26-4 — large-load interconnection cost allocation (rules ~end June 2026); CSIS NuScale CFPP cancellation (Nov 2023); Vogtle 3&4 overrun; V.C. Summer abandonment Fervo de-risking: Stanford Geothermal Workshop (Feb 2024) — 35% learning rate, $9.4M→$4.8M/well Net Zero Insights / PowerMag — 2025 nuclear-fission equity (~$1.3B, record) ——— This episode is for educational and informational purposes only and does not constitute financial, investment, or trading advice, nor a recommendation to buy or sell any security. Deep Dive is not a registered investment adviser. All investing involves risk. Consult a licensed financial professional before making investment decisions.

    16 min
  8. How Anthropic Actually Makes Money

    24 may

    How Anthropic Actually Makes Money

    Anthropic — the company behind Claude — just posted its first-ever operating profit: a projected $559 million on $10.9 billion of revenue in one quarter, from internal projections reviewed by the Wall Street Journal. It's a real milestone. It's also softer than the headline, for a reason almost nobody is explaining. Every month, Anthropic writes a check for more than a billion dollars to a data-center operation now controlled by Elon Musk — about $15 billion a year, roughly 80% of that supplier's entire revenue, on a contract either side can cancel with 90 days' notice. And that bill was discounted during a spring 2026 ramp — the exact quarter the profit appears. Here's why it's structural, not a one-off. All of tech is arguing about whether hyperscalers like Microsoft and Oracle hide the true cost of their chips by stretching depreciation schedules — Michael Burry argues they're understating it by about $176 billion (his own model). Anthropic has the opposite problem: it rents its chips instead of owning them, so it can't smooth that cost at all. The lease is fixed. That's why one discounted quarter flips it to profit and the next can flip it back. You can't smooth a rent check. We walk down the AI "margin ladder" — chipmaker around 75%, cloud 50–55%, model labs 50–60% gross but operating-negative, apps near 25% — to show why a middle-of-the-ladder company is being valued like it sits at the top. Its last confirmed valuation was $380 billion; reports of $900 billion-plus, and an implied ~$1 trillion on thinly-traded private shares, are headlines, not clearing prices. Then we weigh the bull and the bear, because both are right. Bull: the cost to serve a dollar of revenue fell from 71 to 56 cents in a single quarter, and one analysis has inference margins jumping from 38% to over 70%. Bear: the contracts funding the build-out are only 90 days deep, and Bain estimates AI needs about $2 trillion in annual revenue by 2030 and faces an ~$800 billion shortfall. The bridge between them is one question — do you own your compute, or rent it? RELATED EPISODES Musk v. Altman — the courtroom rival is now Anthropic's compute landlord NVIDIA Just Forecast $91 Billion Without China — NVIDIA's ~75% margin is the top rung of the ladder this episode climbs down The Real Cost of AI: Who's Actually Paying for the Build-Out — the 2026 unit-economics update The Last Independent: Cerebras — the 'your supplier is also your competitor' structure CHAPTERS 00:00 Disclaimer — analysis, not investment advice 00:08 Cold open — the Musk check 02:07 The bill: ~$15B/yr to Musk, and the 90-day exit 04:15 The revenue, and the first profit 05:35 Why the profit is just one quarter (the discount) 06:08 The depreciation fight: Burry vs. the audited anchor 07:37 The inversion — you can't smooth a rent check 08:38 The margin ladder: who keeps the money 09:45 The valuation: $380B confirmed vs. $1T thin air 10:20 Bull vs. bear, and the three break conditions 13:50 The verdict: own your compute, or rent it 14:34 Two things to watch, and the close SOURCES SpaceX S-1 (May 2026) — via Reuters, Axios, TechCrunch Anthropic — Series G announcement (Feb 12, 2026) Wall Street Journal — Q2 projections, via PYMNTS and CNBC Amazon Q4 2024 earnings release / 10-K (Feb 2025) Michael Burry / 'Cassandra Unchained' (Nov 2025) Tanay Jaipuria, citing Bessemer (Sep 2025) SemiAnalysis — 'AI Value Capture' (May 2026) Bain — 6th Annual Global Technology Report (Sep 2025) ——— This episode is for educational and informational purposes only and does not constitute financial, investment, or trading advice, nor a recommendation to buy or sell any security. Deep Dive is not a registered investment adviser. All investing involves risk. Consult a licensed financial professional before making investment decisions.

    16 min

Acerca de

Deep Dive is long-form research on AI, tech, and the global economy. Single host, weekly episodes, 25-35 minutes each. The story behind every headline — built from primary sources and original analysis. Recent topics: • AI deanonymization research • Data center infrastructure economics • Strait of Hormuz geopolitics • Agentic AI security • Frontier model behaviors Find Deep Dive across platforms: 📺 YouTube · @DeepDiveAIShow 📱 TikTok · @notdeepdiveai 📷 Instagram · @notdeepdive 🔗 All links · linktr.ee/notdeepdive Tap follow for new episodes.