CloudCostChefs

CloudCostChefs

CloudCostChefs is the weekly show that turns sky-high cloud bills into bite-size savings. In 10 fast minutes you’ll get no-fluff news, hand-tested optimization “recipes,” and automation hacks that keep workloads lean, fast, and budget-friendly—across AWS, Azure, GCP, OCI, and more. Hosted by cost-obsessed cloud engineers, each episode arms you with actionable tips you can run today plus the tools that make your CFO do a happy dance. Aprons on, cloud-cost warriors—let’s get cooking!

  1. HACE 6 DÍAS

    EP10 - Anthropic's $20 Enterprise Flip and Snowflake's 12x Visibility Tax: The Week Flat-Fee AI Pricing Died

    Two announcements landed seven days apart that ended the compute absorption model AI vendors ran from 2023 through 2025. In Episode 10 of Cloud Cost Chefs, we cover both — and the structural FinOps consequence that every enterprise AI budget owner needs to understand before their next renewal. We cover:- Anthropic's Claude Enterprise flip (April 14, 2026): The up-to-$200/user/month flat-fee model with bundled token allowance is gone. The new structure: $20/user/month for platform access, with Claude, Claude Code, and Cowork usage billed separately at standard API rates. Applies to customers with 150+ users. Legacy plans must migrate at next contract renewal or lose grandfathered pricing. We work the math on a 500-user deployment — from $1.2M/year predictable to a $120K platform fee plus variable token consumption that most FinOps teams have never measured.- The industry-wide metering shift: In 30 days — OpenAI moved Codex from flat-message pricing to token metering and launched a $100 Pro tier. GitHub tightened Copilot limits April 10. Windsurf replaced its credit system with daily and weekly quotas in March. Anthropic's precursor signals (Claude Code prompt cache TTL cut from 1 hour to 5 minutes, peak-hour 5-hour session caps for Pro/Max users hitting ~7% of the user base). The flat-fee SaaS-for-AI era is over — Anthropic was not first, but was the clearest signal.- The renewal trap: Organizations that did not capture per-user token consumption during the bundled period are walking into a variable-cost renegotiation with no baseline. We walk through the three questions a FinOps team needs to answer in the next 30 days: actual per-user token consumption today, which use cases justify the variable cost, and what enforcement mechanism exists for budget overruns.- Snowflake Budgets for AI Features GA (April 10, 2026): A legitimate FinOps capability for AI spend — showback, chargeback, per-team user tag attribution across AI Functions, Cortex Code, Cortex Agents, and Snowflake Intelligence. The release note looks clean. The implementation documentation exposes the catch.- The 12x visibility tax: Snowflake's budget documentation confirms — a budget consumes 1 credit per month at the default 6.5-hour refresh, or 12 credits per month at 1-hour refresh. Real-time governance on AI spend comes with a 12x premium on the governance function itself. And the underlying `CORTEX_AI_FUNCTIONS_USAGE_HISTORY` view has a 60-minute maximum latency — meaning even paying the 12x premium caps the effective governance loop at one hour. For runaway agent workloads burning credits at 10x normal rate, that is still a meaningful blind spot.- The incident economics of AI observability: We work the math on a runaway Cortex Agents workload scenario — when the 12x refresh uplift pays for itself after one incident, and when it doesn't at portfolio scale across 40 workloads with different consumption profiles.- The cross-platform pattern: AWS Bedrock Data Exports (Episode 8), Azure Log Analytics ingestion pricing, GCP BigQuery billing export query costs — every major platform has the same emerging economic structure. AI observability is no longer overhead; it is a metered product line item that competes with the spend it's measuring.- The connecting thesis: Anthropic is passing inference costs directly into the invoice. Snowflake is passing the cost of seeing those costs into the invoice. Episode 9 argued that FinOps had to evolve into a technology value function. Seven days later, April 2026 made the evolution non-optional. The closing question: If your organization went to Claude Enterprise renewal tomorrow, would you have per-user token baseline data to negotiate against? If your Cortex Agents workload starts burning credits at 10x normal rate, how long before the budget catches it — and what does that detection cost per month? That's Episode 10.

    20 min
  2. 10 ABR

    EP9 - FinOps Beyond Cloud: The Technology Value Mandate, the AI Trough, and Why 95% of GenAI Projects Return Zero ROI

    The FinOps Foundation officially changed its mission in March 2026 — from "managing the value of cloud" to "managing the value of technology." That's not a rebranding. It's a mandate expansion that most FinOps organizations aren't staffed or authorized to execute. In Episode 9, we forensic the gap. We cover:- The Framework 2026 mission change: What "technology value" actually means operationally — and why "Executive Strategy Alignment" as a new FinOps capability requires practitioner access to technology selection decisions that 88% of teams don't currently have.- The authority gap by the numbers: Practitioners with VP-level or higher sponsorship are 2–4x more likely to influence technology decisions. 53% with C-suite access can influence cloud service selection; 12% with director-level access can. The mandate says govern technology value. The org chart says report on billing data.- The scope math: 98% of FinOps teams now manage AI costs (up from 31% in 2024). 90% manage SaaS. 64% manage licensing. 48% manage data centers. 28% manage labor costs. Team size: unchanged at 8–10 practitioners. The Foundation's scaling answer is automation and federated enablement — which works for cloud cost ops, and is not obviously sufficient for Executive Strategy Alignment.- Gartner's $2.52 Trillion Trough: Worldwide AI spending hits $2.52 trillion in 2026 (+44%), while Gartner simultaneously places AI in the Trough of Disillusionment. Why these aren't contradictory: discretionary AI projects (where 95% returned zero ROI, per MIT) are different from incumbent vendor AI bundling (Copilot, Einstein AI, ServiceNow AI embedded in renewals enterprises can't easily exit).- The 9% inflation tax: CIOs are allocating 9% of their entire IT budget to price increases on existing software in 2026. GenAI model spending is up 80.8%. Most organizations cannot quantify whether the AI features in their enterprise software contracts are being used, let alone whether they're worth the premium.- The Copilot question nobody can answer: How many organizations can tell you per-department Copilot utilization, cost per AI-assisted task versus manual baseline, and whether the $30/user/month is justified by the specific use cases running at their company? The FinOps team that can answer this is operating as a technology value function. Most cannot.- The Trough has a floor: Unlike previous hype cycles, AI spending is growing 44% while in the Trough. Hyperscaler capex ($600B in 2026), infrastructure commitments, and bundled SaaS renewals don't pause for disillusionment. The organizations building AI ROI measurement infrastructure during the Trough will be positioned to allocate effectively when the Plateau of Productivity arrives in 2027–2028. The closing argument: the FinOps Foundation changed the mission because the cost story broke. When 9% of IT budgets go to price increases on existing software and 95% of AI projects return zero, "cloud cost management" was never going to be enough. The question is whether organizations restructure FinOps to execute the technology value mandate or just rename the function without changing the authority structure. Research sourced from FinOps Foundation Framework 2026, State of FinOps 2026 Report (1,192 respondents, $83B+ in spend), Gartner IT Spending and AI Forecasts (January–February 2026), MIT GenAI ROI study (2025), and S&P Global cloud infrastructure analysis. CloudCostChefs — FinOps knowledge, served fresh. https://cloudcostchefs.com

    21 min
  3. 3 ABR

    EP8 - FinOps Automation in 2026: Why Cloud Waste Rose to 29% — and How to Govern AWS Bedrock AI Costs Before They Compound

    72% of companies exceeded their cloud budget last year. Not because dashboards don't exist, but because manual FinOps can't move fast enough. In Episode 8, we look at why automation is the gap, and why AI just made that gap worse. Here's what we cover: - The automation gap by the numbers: Teams with mature FinOps automation save 25–30% more than manual-only teams. Automated commitment management (Reserved Instances, Savings Plans) delivers 15–35% higher savings rates. The ROI is clear. The adoption isn't. So why? - Cloud waste ticked back up to 29% in 2026: After years of steady decline, it reversed. Why? AI workloads arrived faster than governance did. The tools built for EC2 rightsizing and RI management don't handle inference costs. The waste moved into that gap. - The Flexera/ProsperOps acquisition signal: In January 2026, Flexera bought ProsperOps—an autonomous commitment management platform. What does this tell you about where FinOps automation is headed? And what does it mean for practitioners whose job is doing this work by hand? - The last-mile problem: What happens when you actually automate everything? Mature teams hit a 97% optimization ceiling. Then comes the harder work: forecasting, unit economics, AI cost governance. - AWS Bedrock operation-level CUR (January 2026): AWS Data Exports now breaks out InvokeModelInference and InvokeModelStreamingInference as separate line items. Combined with Application Inference Profiles and cost allocation tags, you can track AI spend by model, team, and use case—if you set it up. Most teams don't. - Bedrock AgentCore: AWS launched a managed runtime for agentic AI workloads this week. Self-invoking, multi-model agents are a FinOps nightmare. You need attribution infrastructure before they scale. - The prompt inefficiency tax: A 4,000-token system prompt that should be 800 tokens is a 5x cost multiplier on every API call. Prompt caching cuts costs by up to 90%. Intelligent routing cuts spend by 30% without losing accuracy. Your FinOps dashboard can't see any of this right now. - Cloud price increases coming: Dell raised server prices 15–20%. OVH already announced 5–10% increases for April–September 2026. AWS, Azure, and GCP typically follow 3–6 months behind. IDC warns G1000 organizations will face a 30% rise in underestimated AI infrastructure costs by 2027. Teams without cost baselines won't be able to explain what hit them. Here's the catch: the automation tools for traditional cloud costs took eight years to reach mainstream adoption. The tools for AI inference cost governance were built in January 2026. The question is whether the FinOps community replays that eight-year lag—or finally learns from it. Research from: State of FinOps 2026 (FinOps Foundation), Flexera 2026 State of the Cloud, Forrester Public Cloud Market Outlook 2026, IDC FutureScape 2026, AWS product documentation, and Finout industry analysis. CloudCostChefs — FinOps knowledge, served fresh. https://cloudcostchefs.com

    21 min
  4. 13 MAR

    EP5: The AI Cost Pincer — Runaway Agents & Microsoft's Mandatory 25% Tax

    Two AI agents get stuck arguing with each other for 11 days. The reported bill: tens of thousands of dollars. Zero useful output. Gartner just named this pattern "Agentic Resource Exhaustion" at their March 2026 Summit — and predicted 40% of agentic AI projects will be canceled by 2027. Meanwhile, Microsoft's cascading pricing changes could impose a 12 to 25% increase on Enterprise Agreements — and the FTC is investigating. In this episode, we tackle the two forces squeezing FinOps teams from both sides.Topic 1: The Agentic AI Cost Crisis. 98% of FinOps teams now manage AI spend. Fewer than half have financial guardrails. Autonomous agents are provisioning their own compute, spinning in recursive loops, and burning budgets overnight. We break down a real-world multi-agent cost blowout, explain why agentic workflows consume 10-50x more tokens than simple queries, honestly assess where the data is strong versus shaky, and cover the governance tools — from TrueFoundry's Budget Grant model to OWASP's new "Denial of Wallet" vulnerability — that are emerging to contain the damage. Topic 2: Microsoft's AI Tax. Three compounding stages: EA volume discount elimination (up to 12%), forced Copilot bundling into M365 (5-33% per SKU), and the Unified Support multiplier that compounds on top. For the largest enterprises, $10M becomes $12.5M. Copilot has 3.3% penetration, 35.8% active usage, and negative NPS — but you're paying for it anyway. We cover Microsoft's $120B+ annual CapEx trajectory, the $80B unfulfillable Azure backlog, scrutinize the source bias, and explain why AWS and Google are running the same playbook. The thesis: whether you adopt AI or not, you're paying for it. Your agents are burning money. Your vendors are raising rent. The old FinOps playbook was built for deterministic compute. That world is ending. Data from Gartner, State of FinOps 2026, IDC, Recon Analytics, US Cloud, Info-Tech Research Group, Microsoft earnings, FTC filings, and practitioner case studies. No sponsors. No vendor partnerships. New episodes weekly covering the top FinOps stories with data, analysis, and zero industry spin. cloudcostchefs.com #FinOpsPodcast #AI #M365 #Copilot

    20 min
  5. 6 MAR

    EP4: The Repatriation Files & The Shift-Left Fantasy

    37signals saved $10 million by leaving AWS. GEICO saw costs explode 2.5x after migrating 600 applications. 86% of CIOs now plan to repatriate at least some workloads. But Zynga spent $100 million on data centers and crawled back to AWS. And 75% of repatriation projects fail. In this episode, we tackle the two biggest questions in FinOps right now — with the counterarguments included. Topic 1: The Repatriation Files. Three companies, three scales, three outcomes. We break down 37signals' $10M exit (and why their math doesn't apply to you), GEICO's enterprise-scale 2.5x cost explosion, Ahrefs' $400M claim (and the methodological problems nobody mentions), and Zynga's cautionary $100M failure. Plus the Barclays 86% stat — what it actually means vs. how it's being sold. Topic 2: The Shift-Left Fantasy. Pre-deployment cost estimation has been "coming next year" since 2023. The State of FinOps 2026 still lists it as the #1 requested capability — not the #1 delivered capability. Three years and a $4.6B acquisition later, the best the market has is a public preview. The reason goes deeper than tooling: you literally cannot prove prevention. An engineer who avoids $200K in waste gets zero credit because the savings never appear on a dashboard. We explain the prevention paradox, why federation doesn't work without tooling, and the four things that would actually need to change. Data from The Register, The Stack, Barclays CIO Survey, State of FinOps 2026, MarketsandMarkets, DHH, Michael Gat, and practitioner case studies. No sponsors. No vendor partnerships.New episodes weekly covering the top FinOps stories with data, analysis, and zero industry spin.cloudcostchefs.com #CloudRepatriation #37signals #Zynga #Shift-Left #FinOps

    22 min
  6. 27 FEB

    EP3: Self-Fund AI — The Impossible Mandate Breaking FinOps Teams

    "Find savings in cloud to fund AI." That's the mandate crushing FinOps teams in 2026. The State of FinOps 2026 report dropped last week. 96,000 members. 98% of teams managing AI spend. The Foundation changed its mission from "Cloud" to "Technology." The headline story: adoption is at all-time highs. The buried story: practitioners are drowning. In this episode, we pick up where Episode 1 left off with the fresh 2026 data, then dive deep into the mandate nobody's talking about honestly. First: The 2026 Report Card. The Foundation expanded FinOps to cover AI, SaaS, licensing, private cloud, and data centers — 5x the scope. But waste is still 27-35%, maturity is still at "Crawl," and the response was to rename the category, not fix the problem. Second: The Self-Fund AI Mandate. Teams of 8-10 people are being told to squeeze savings from already-optimized cloud estates to fund AI initiatives. Meanwhile, 80% of companies miss AI cost forecasts by 25%+. Inference costs run 15-20x training costs. AWS quietly hiked GPU prices 15% on a Saturday. And the vendors' solution? AI tools to manage AI costs. We cover the Mavvrik survey (84% margin erosion from AI), the inference cost bomb ($150M training → $2.3B inference), the GPU price hike nobody announced, and the 5 things that actually work — including why the best teams are pushing back on the mandate itself. Data from the FinOps Foundation, Mavvrik, Apptio, AWS, Flexera, CloudBolt, Anyscale, and practitioner discussions. No sponsors. No vendor partnerships. New episodes weekly covering the top FinOps stories with data, analysis, and zero industry spin.cloudcostchefs.com #finops #cloudcost #cloudwaste #aicostmanagement

    18 min

Acerca de

CloudCostChefs is the weekly show that turns sky-high cloud bills into bite-size savings. In 10 fast minutes you’ll get no-fluff news, hand-tested optimization “recipes,” and automation hacks that keep workloads lean, fast, and budget-friendly—across AWS, Azure, GCP, OCI, and more. Hosted by cost-obsessed cloud engineers, each episode arms you with actionable tips you can run today plus the tools that make your CFO do a happy dance. Aprons on, cloud-cost warriors—let’s get cooking!