Platform Engineering Playbook Podcast

vibesre

The Platform Engineering Playbook Podcast is where AI meets open-source infrastructure knowledge—and you're part of the editorial process. Every episode is researched, scripted, and produced with AI, then reviewed by the community and published on GitHub for anyone to improve. Facing tool sprawl across 130+ platforms? Justifying PaaS costs to your CFO? Navigating the Shadow AI crisis hitting 85% of organizations? We tackle the messy realities of platform engineering that most content avoids, delivering data-backed insights and decision frameworks you can use Monday morning. Built for senior engineers, SREs, and DevOps practitioners with 5+ years in production, we dissect cloud economics, AI governance, infrastructure trade-offs, and career strategy—with the receipts to back it up. Think we got something wrong? Have better data? Open a pull request at platformengineeringplaybook.com. This is infrastructure podcasting as a living document, where the community keeps us honest and the content gets better with every contribution. Read the playbook at https://platformengineeringplaybook.com

  1. 16 HR AGO

    The Next Platform Engineer: AI + Observability + FinOps

    **Is AI about to revolutionize how we build infrastructure? The CNCF CTO says we're not prepared for what's coming.** In this episode of Platform Engineering Playbook, we dive deep into the future of cloud native infrastructure and why 2026 might be the year everything changes. Based on Chris Aniszczyk's latest insights, we explore how AI agents are moving beyond just consuming our platforms to actively designing and managing them. **What You'll Learn:** • How AI is reshaping platform engineering workflows and decision-making • Why current Kubernetes evolution patterns may not be sustainable • Practical strategies for platform engineers to prepare for AI-driven infrastructure • Key takeaways from the CNCF's 2026 observability trends **Episode Chapters:** 0:00 Cold Open - AI's Infrastructure Revolution 2:15 Today's Platform Engineering News 8:30 Deep Dive: CNCF CTO's 2026 Predictions 15:45 Technical Analysis: Kubernetes at Scale Whether you're building internal developer platforms or managing cloud native infrastructure at scale, this episode provides actionable insights for navigating the intersection of AI and platform engineering. **Sources & References:** - State of cloud native 2026: CNCF CTO's insights: https://www.cncf.io/blog/2026/02/19/state-of-cloud-native-2026-cncf-ctos-insights-and-predictions/ - CNCF 2026 Observability Summit Schedule: https://www.cncf.io/announcements/2026/02/18/cncf-releases-2026-observability-summit-north-america-schedule-as-cloud-native-observability-adoption-expands/ - DevOps Modernization with AI Agents: https://www.infoq.com/presentations/devops-modernization-ai-agents/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=global - Amazon Connect Cases AWS Service Quotas: https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-connect-cases-aws-service-quotas - Cloudflare HTTP 5xx Errors Incident: https://www.cloudflarestatus.com/incidents/xhmtd6x13cw1 #PlatformEngineering #DevOps #CloudNative #Kubernetes

    18 min
  2. 1 DAY AGO

    Ray + Kubernetes: The Production AI Stack Explained

    **Why do 92% of ML models never reach production?** It's not a code problem—it's a platform engineering problem. In today's episode of Platform Engineering Playbook, we tackle the massive infrastructure gap that's keeping AI initiatives stuck in notebooks while your data science teams wonder why their brilliant models never see the light of day. **What You'll Learn:** ✅ The real reasons ML models fail to reach production (hint: it's your infrastructure) ✅ How to architect production-ready AI infrastructure using Ray on Kubernetes ✅ Practical strategies for platform engineers supporting data science teams ✅ Enterprise GitOps scaling from single clusters to fleet management **Episode Breakdown:** 0:00 Cold Open - The 92% problem 2:15 Industry News Roundup 8:30 Deep Dive: From Notebooks to Production 15:45 Architecture Analysis: Ray on Kubernetes **Today's Platform Engineering News:** • Datadog's new audit-ready compliance reporting • Amazon Bedrock transforming HR talent acquisition • The hidden cost of burning out your on-call engineers • Enterprise GitOps fleet management strategies Whether you're struggling with ML infrastructure or just want to stay ahead of platform engineering trends, this episode gives you actionable insights you can implement today. **Sources & References:** - From notebooks to nodes: Architecting production-ready AI infrastructure: https://thenewstack.io/production-ai-infrastructure-guide/ - Generate audit-ready vulnerability and compliance reports with Datadog Sheets: https://www.datadoghq.com/blog/audit-reports-datadog-sheets/ - AI meets HR: Transforming talent acquisition with Amazon Bedrock: https://aws.amazon.com/blogs/machine-learning/ai-meets-hr-transforming-talent-acquisition-with-amazon-bedrock/ - Is your on-call rotation quietly burning out top talent?: https://thenewstack.io/sustainable-on-call-strategies/ - How to scale GitOps in the enterprise: From single cluster to fleet management: https://platformengineering.org/blog/how-to-scale-gitops-in-the-enterprise #PlatformEngineering #DevOps #CloudNative #Kubernetes

    18 min
  3. 2 DAYS AGO

    Replace 5 Databases with 1? SurrealDB for AI Agents Explained

    Your AI agents are using five different databases right now - and you don't even know it. This database sprawl is silently killing your platform's performance and your team's sanity. In today's Platform Engineering Playbook, we dive deep into SurrealDB's multi-model approach and how it's revolutionizing AI infrastructure. Plus, breaking news on vulnerability management patterns that every platform engineer needs to understand. **What You'll Learn:** • Why database proliferation is the hidden killer of AI agent performance • SurrealDB's architecture deep dive and real-world deployment strategies • When (and when NOT) to consolidate your AI infrastructure databases • The contextual SBOM pattern transforming vulnerability management • India's massive $200B AI infrastructure play and what it means for the industry **Timestamps:** 0:00 Cold Open - The Database Sprawl Crisis 2:15 SurrealDB Deep Dive - Architecture & Implementation 15:30 Practical Takeaways - When to Use Multi-Model Databases **Why Listen?** Get actionable insights from real platform engineering challenges, not theoretical fluff. We break down complex infrastructure decisions into practical guidance you can implement today. Perfect for platform engineers, DevOps teams, and infrastructure architects building scalable AI systems. **Sources & References:** • SurrealDB Docker Extension: https://www.docker.com/blog/deploy-surrealdb-docker-desktop-extension/ • Spectral Collapse in Diffusion Inversion: https://arxiv.org/abs/2602.13303 • India AI Infrastructure Investment: https://techcrunch.com/2026/02/17/india-bids-to-attract-over-200b-in-ai-infrastructure-investment-by-2028/ • Contextual SBOM Pattern: https://developers.redhat.com/articles/2026/02/17/how-contextual-sbom-pattern-improves-vulnerability-management #PlatformEngineering #DevOps #CloudNative #Kubernetes

    19 min
  4. 3 DAYS AGO

    Agoda’s API Agent Turns Any API into MCP — No Code, No Deployments

    **What if API integration nightmares could disappear without writing a single line of code?** Agoda just dropped a game-changing solution that transforms any API into MCP (Model Context Protocol) with zero deployments - and it's about to reshape how platform teams approach AI integrations. In today's Platform Engineering Playbook, we break down this revolutionary no-code approach and explore what it means for enterprise platform strategies. Plus, we dive into Docker's latest sandbox capabilities with NanoClaw, performance testing breakthroughs for Identity Management systems using encrypted DNS in OpenShift, and the emerging patterns for running AI coding agents on Kubernetes. **What You'll Learn:** ✅ How Agoda's API Agent eliminates integration complexity ✅ The three-layer architecture powering zero-code API transformations   ✅ Real-world implications for platform engineering teams ✅ Docker's new sandboxing capabilities for secure code execution ✅ Advanced load testing strategies for IdM systems with eDNS and CoreDNS **Timestamps:** 00:00 Cold Open - The API Integration Revolution 02:15 Deep Dive Act 1 - Agoda's Game-Changing Approach 08:30 Deep Dive Act 2 - Architecture Deep Dive 15:45 Deep Dive Act 3 - Platform Team Takeaways Perfect for platform engineers, DevOps teams, and technical leaders navigating the AI-platform integration landscape. **Sources & References:** - Agoda's API Agent: https://www.infoq.com/news/2026/02/agoda-api-agent/ - Docker NanoClaw Sandboxes: https://www.docker.com/blog/run-nanoclaw-in-docker-shell-sandboxes/ - IdM Load Testing with eDNS: https://developers.redhat.com/articles/2026/02/16/load-testing-idm-edns-coredns-openshift - Kubernetes for AI Agents: https://cloudnativenow.com/features/gas-town-what-kubernetes-for-ai-coding-agents-actually-looks-like/ #PlatformEngineering #DevOps #CloudNative #Kubernetes

    19 min
  5. 4 DAYS AGO

    LocalStack Kills Community Edition: What Breaks in March

    **LocalStack just killed their open-source edition - but what does this really mean for your platform engineering stack?** In today's episode of Platform Engineering Playbook, we break down LocalStack's shocking decision to discontinue their Community Edition and what it means for teams relying on AWS local development. Plus, we dive into the ripple effects across the developer ecosystem and provide a practical decision framework for your next moves. **What You'll Learn:** • Why LocalStack's pricing shift from free to $39/month matters for platform teams • Decision frameworks for evaluating local development alternatives • How AI is revolutionizing code deployment at Spotify • The surprising exodus from computer science programs and where students are heading • Insider Claude coding tips from the engineer who built it • Why Hollywood is concerned about Seedance 2.0's video generation capabilities **Episode Chapters:** 0:00 Cold Open - LocalStack's Open Source Bombshell 2:15 Deep Dive Act 1 - The Setup 8:30 Deep Dive Act 2 - Pricing Analysis & Impact Whether you're managing platform infrastructure or building developer tooling, this episode gives you the insights and frameworks to navigate these industry shifts strategically. **Sources & References:** • LocalStack Community Edition Concerns: https://www.infoq.com/news/2026/02/localstack-aws-community/?utm_campaign=infoq_content&utm_source=infoq&utm_medium=feed&utm_term=global • Seedance 2.0 Hollywood Controversy: https://techcrunch.com/2026/02/15/hollywood-isnt-happy-about-the-new-seedance-2-0-video-generator/ • Spotify AI Code Deployment: https://nextunicorn.ventures/ai-revolutionizes-code-deployment-at-spotify/ • CS Student Exodus Analysis: https://techcrunch.com/2026/02/15/the-great-computer-science-exodus-and-where-students-are-going-instead/ • Claude Coding Tips: https://www.anup.io/35-claude-code-tips-from-the-guy-who-built-it/ #PlatformEngineering #DevOps #CloudNative #Kubernetes

    16 min
  6. 13 FEB

    OpenTofu vs Terraform: What Enterprise Teams Are Actually Doing (2026)

    **Is your infrastructure strategy about to become obsolete?** By 2025, half of all Terraform installations could be running OpenTofu - and the implications for platform engineering teams are massive. In today's deep dive, we break down the OpenTofu vs. Terraform battle that's reshaping infrastructure as code. You'll learn the real mechanics behind migrating between these tools, practical decision frameworks for enterprise teams, and why this choice could define your platform's next five years. **What You'll Learn:** • The technical and business drivers behind the OpenTofu fork • Step-by-step migration strategies and gotchas to avoid   • How to evaluate which tool fits your team's needs • Real-world implications for existing Terraform workflows **Episode Chapters:** 0:00 Cold Open - The OpenTofu prediction 2:15 Today's Platform Engineering News 8:30 Deep Dive Act 1 - Understanding the OpenTofu vs Terraform landscape **Plus:** AWS Elastic Beanstalk's new GitHub Actions integration, Dapr Runtime updates, and scaling secure DevOps practices across enterprise teams. Perfect for platform engineers, DevOps practitioners, and infrastructure leaders navigating the evolving IaC landscape. **Sources & References:** • OpenTofu vs. Terraform Guide: https://www.env0.com/blog/opentofu-vs-terraform-a-practical-guide-for-enterprise-infrastructure-teams • AWS Elastic Beanstalk GitHub Actions: https://aws.amazon.com/about-aws/whats-new/2026/02/aws-elastic-beanstalk-github-action • Dapr Runtime v1.17.0-rc.6: https://github.com/dapr/dapr/releases/tag/v1.17.0-rc.6 • ByteDance AI Video Generator: https://www.theverge.com/ai-artificial-intelligence/877931/bytedance-seedance-2-video-generator-ai-launch • Secure DevOps at Scale: https://devops.com/secure-devops-at-scale-integrating-sre-devsecops-and-compliance/ #PlatformEngineering #DevOps #CloudNative #Kubernetes

    18 min
  7. 12 FEB

    Why Databases Inside Kubernetes Are Becoming Technical Debt

    **Is running databases in Kubernetes about to become legacy technical debt overnight?** By 2026, the inference cloud revolution is forcing platform engineers to completely rethink database architecture - and the implications are massive. In today's deep dive, we break down the "container paradox" that's reshaping how we think about stateful workloads in Kubernetes. You'll discover why the rise of AI inference is making traditional database-in-K8s patterns unsustainable and what this means for your platform strategy. **What You'll Learn:** • Why the inference cloud demands decoupled database architectures • A practical framework for assessing your statefulness spectrum • How operator complexity is becoming a hidden cost center • Real-world lessons from World Bank's hybrid cloud transformation with Terraform **Key Topics Covered:** • The container paradox driving database architecture changes • Kubernetes cultural shifts enabling AI expansion • CloudFront's new mTLS authentication for zero trust architectures • Latest developments in federated variational inequalities **Timestamps:** 0:00 Cold Open - The 2026 Database Prediction 2:15 Platform Engineering News Roundup 8:30 Deep Dive: The Container Paradox 15:45 Operator Complexity Analysis Perfect for platform engineers, DevOps teams, and infrastructure architects navigating the evolving Kubernetes landscape. **Sources & References:** • The Container Paradox: https://www.digitalocean.com/blog/the-container-paradox-k8s-databases • World Bank Terraform Case Study: https://www.hashicorp.com/blog/how-world-bank-manages-hybrid-cloud-complexity-with-terraform • Kubernetes AI Culture Impact: https://www.infoq.com/news/2026/02/kubernetes-ai-culture-impact/ • CloudFront mTLS Update: https://www.infoq.com/news/2026/02/amazon-cloudfront-mtls-origins/ • Federated Variational Inequalities: https://arxiv.org/abs/2602.09164 #PlatformEngineering #DevOps #CloudNative #Kubernetes

    18 min
  8. 11 FEB

    47% of CNCF Projects Slowed Down in 2025 — Why That’s Actually Good News

    **Why did 47% of CNCF projects slow down their development velocity in 2025 — and why platform engineers should celebrate this trend?** In today's Platform Engineering Playbook, we decode what declining commit velocity across cloud native projects actually reveals about infrastructure maturity and what it means for your platform strategy. **What You'll Learn:** • How to interpret CNCF project velocity metrics as leading indicators for platform decisions • Why slower development cycles might signal stronger, more stable infrastructure foundations • Strategic insights for platform engineers navigating the evolving cloud native landscape • Breaking analysis of agentic AI transforming DevOps automation and autonomous infrastructure **Episode Breakdown:** 0:00 Cold Open - The 47% velocity drop revelation 2:15 Today's Platform Engineering News Roundup 8:30 Deep Dive Act 1 - Decoding CNCF Velocity Data **Today's News Coverage:** - Harness unveils agentic AI for autonomous infrastructure management - TiDB's emergence as the first truly AI-native database - Oracle Cloud's new DevSecOps-as-a-Service offering - Ex-Google team building revolutionary video data infrastructure Whether you're architecting platforms or optimizing cloud native workflows, this episode delivers the strategic insights you need to stay ahead of infrastructure trends. **Sources & References:** - CNCF Project Velocity Analysis: https://www.cncf.io/blog/2026/02/09/what-cncf-project-velocity-in-2025-reveals-about-cloud-natives-future/ - Agentic AI in DevOps: https://www.harness.io/blog/agentic-ai-in-devops-the-architects-guide-to-autonomous-infrastructure - TiDB AI-Native Database: https://thenewstack.io/tidb-and-the-rise-of-the-ai-native-database/ - Oracle DevSecOps-as-a-Service: https://about.gitlab.com/blog/devsecops-as-a-service-on-oracle-cloud-infrastructure-by-data-intensity/ - Video Data Infrastructure: https://techcrunch.com/2026/02/09/ex-googlers-are-building-infrastructure-to-help-companies-understand-their-video-data/ #PlatformEngineering #DevOps #CloudNative #Kubernetes

    19 min

About

The Platform Engineering Playbook Podcast is where AI meets open-source infrastructure knowledge—and you're part of the editorial process. Every episode is researched, scripted, and produced with AI, then reviewed by the community and published on GitHub for anyone to improve. Facing tool sprawl across 130+ platforms? Justifying PaaS costs to your CFO? Navigating the Shadow AI crisis hitting 85% of organizations? We tackle the messy realities of platform engineering that most content avoids, delivering data-backed insights and decision frameworks you can use Monday morning. Built for senior engineers, SREs, and DevOps practitioners with 5+ years in production, we dissect cloud economics, AI governance, infrastructure trade-offs, and career strategy—with the receipts to back it up. Think we got something wrong? Have better data? Open a pull request at platformengineeringplaybook.com. This is infrastructure podcasting as a living document, where the community keeps us honest and the content gets better with every contribution. Read the playbook at https://platformengineeringplaybook.com

You Might Also Like