How I AI

Claire Vo

How I AI, hosted by Claire Vo, is for anyone wondering how to actually use these magical new tools to improve the quality and efficiency of their work. In each episode, guests will share a specific, practical, and impactful way they’ve learned to use AI in their work or life. Expect 30-minute episodes, live screen sharing, and tips/tricks/workflows you can copy immediately. If you want to demystify AI and learn the skills you need to thrive in this new world, this podcast is for you.

  1. How to design AI agent loops: schedules, goals, and subagents in Claude Code and Codex

    hace 9 h

    How to design AI agent loops: schedules, goals, and subagents in Claude Code and Codex

    I break down every loop type from scratch—what a heartbeat, cron, hook, and goal loop actually are, when each one fits, and the five things any effective loop needs before it touches production. Then I build two live loops: a daily aging-PR reviewer in Claude Code that schedules itself at 10:15 a.m. and spins off its own subagents, and a weekly skills-identification loop in Codex that spawns goal-based subagents to validate its own output in real time. What you’ll learn: The plain-English definition of a loop—and why it’s just an automated prompt, not a scary new paradigmThe four loop types (heartbeat, cron, hook, and goal) and when each one actually fits your workflowHow to think about loop design using the “onboarding an employee” mental modelThe five things every effective loop needs: work trees, skills, plugins/connectors, subagents, and state trackingHow to build a scheduled PR-review routine in Claude Code that babysits aging PRs and alerts your teamHow to set up a weekly skills-identification automation in Codex that spawns its own validating subagentsWhy goal-based loops are the hardest to write well—and where most people burn tokens for nothingThe two warning signs that your loop is going to get expensive before it gets useful— Brought to you by: WorkOS—Make your app enterprise-ready today Runway—The creative AI platform for images, video, and more — In this episode, we cover: (00:00) Prompts are out and loops are in (02:30) Defining a loop (03:03) The four ways to automate a prompt: heartbeat, cron, hooks, and goals (06:03) Five things every effective loop needs (09:26) The “onboarding an employee” framework for designing loops (11:58) Live build #1: Daily aging PR loop in Claude Code (17:08) Subagents inside loops (19:00) Live build #2: Weekly skills identification loop in Codex (22:57) Watching subagents spin up in real time (25:28) Warning signals around loops (27:31) What listeners are doing with loops — Tools referenced: • Claude Code: https://claude.ai/code • Codex: https://chatgpt.com/codex • OpenClaw: https://openclaw.ai/ — Other references: • Claire’s article “Why OpenClaw Feels Alive Even Though It’s Not”: https://x.com/clairevo/article/2017741569521271175 • Addy Osmani’s article on loop engineering: https://addyosmani.com/blog/loop-engineering/ • Using Goals in Codex: https://developers.openai.com/cookbook/examples/codex/using_goals_in_codex — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

    29 min
  2. How Braintrust uses AI agents, evals, and CI to ship better software | Ankur Goyal

    hace 2 días

    How Braintrust uses AI agents, evals, and CI to ship better software | Ankur Goyal

    In this episode, I sit down with Ankur Goyal, founder and CEO of Braintrust, the AI evals and observability platform used by teams like Notion, Stripe, Vercel, and Zapier. This one is for the senior engineers, staff engineers, VPs of engineering, and CTOs in my audience. We get into how coding agents can take on deeply technical architecture and infrastructure work that no single human engineer could tackle before, and then we demystify evals so you can use them to make your AI products better without touching the implementation. What you’ll learn: How Ankur uses Codex to run week-long benchmark experiments across database indexes, column store formats, and execution engines to speed up slow queriesWhy he argues there’s no excuse to skip rigorous benchmarking now that agents can run them tirelesslyThe “agent line” framework: how to decide which decisions, directions, and interactions you can hand off to an agentHow I think about the practical vs. theoretical quality of AI on hard technical problems, and why human attention decays on tedious workWhy evals are the modern version of a PRD, and how to encode “what good looks like” so a model can figure out the “how”How to build a scoring function live and let an agent improve your prompt inside a safe playgroundHow Ankur turned his designer David’s taste into a repeatable eval so quality scales beyond one personWhy fixing your CI is the highest-leverage way to speed up engineering velocity— Brought to you by: Guru—The AI layer of truth Persona—Trusted identity verification for any use case — In this episode, we cover: (00:00) Introduction to Ankur Goyal (03:00) Using AI agents for database optimization (06:10) Running exhaustive benchmarks with coding agents (09:03) Why staff engineers are wrong about AI limitations (11:30) The “agent line” framework for delegation (14:00) Ankur’s workflow: running 4 to 6 concurrent agents (17:16) Technical setup: foreground agents, background agents, and cloud environments (20:32) Spending time with AI tools (23:06) Demystifying evals (26:02) Live demo: Building an eval for documentation answers (30:20) The alternative to evals: vibe checks and whack-a-mole (32:09) Capturing designer taste in scoring functions (33:13) Quick recap (33:44) Managing velocity and throughput (35:40) Why CI/CD investment is critical for AI-accelerated teams (37:30) Ankur’s prompting strategy when agents fail (39:10) Closing thoughts and how to connect — Tools referenced: • Braintrust: https://www.braintrust.dev/ • Codex: https://openai.com/codex/ • GPT 5.4: https://developers.openai.com/api/docs/models/gpt-5.4 • Claude: https://claude.ai/ — Other references: • GPT 5.5 just did what no other model could: https://www.lennysnewsletter.com/p/gpt-55-just-did-what-no-other-model • Paul Graham’s Maker vs. Manager Schedule: http://www.paulgraham.com/makersschedule.html • tmux: https://github.com/tmux/tmux • Chris Tate at Vercel: https://www.linkedin.com/in/ctatedev/ — Where to find Ankur Goyal: LinkedIn: https://www.linkedin.com/in/ankrgyl/ — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

    40 min
  3. Claude Fable 5 review: what the new Mythos model gets right (and very wrong)

    9 jun

    Claude Fable 5 review: what the new Mythos model gets right (and very wrong)

    Claude Fable 5 is the first Mythos-class intelligence model to be generally available, and I got early access to test it before launch. In this episode, I walk through what Anthropic is promising, what actually stood out when I used it on real work, and where I think it fits in your AI stack. — In this episode, we cover: (00:00) Introduction: Fable 5 is finally here (00:31) What Anthropic says about the model (05:14) Token-intensive by design (06:28) Safety classifiers and the new fallback concept (07:46) Is this or is this not Mythos? (08:30) New product launches: Managed Agents and more (09:20) Crushing benchmarks (09:55) What it’s actually like to use (the good and the bad) (11:40) Test 1: product graph spec (12:56) Test 2: designing a skills registry (14:04) Conservative on execution (14:43) Test 3: multi-agent orchestration (15:39) My takeaways — Tools referenced: • Claude Fable 5: https://www.anthropic.com/news/claude-fable-5-mythos-5 • Claude Managed Agents: https://platform.claude.com/docs/en/managed-agents/overview — Other reference: • SWBench Pro benchmark: https://www.swebench.com/ — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

    17 min
  4. Shopping with Claude: How to find quality brands, automate returns, and buy things that last 100 years | Nicole Ruiz

    8 jun

    Shopping with Claude: How to find quality brands, automate returns, and buy things that last 100 years | Nicole Ruiz

    Nicole Ruiz is a writer and parent who has built a comprehensive AI-powered shopping system to help her family buy high-quality, long-lasting items while avoiding the noise of drop-shipping brands, paid ads, and poorly made products. She writes an interview series on Substack about how technology is changing the household. What you’ll learn: How to build a Claude Project with custom instructions for vetting brands based on heritage, craftsmanship, and return policiesThe shopping criteria that help surface century-old manufacturers over trendy direct-to-consumer brandsHow to use Claude to search through trusted vendor websites that have terrible UXWhy AI actually helps small artisans and heritage brands compete against Amazon’s infrastructureHow to use Claude Cowork to automate returns by finding receipts in your email and drafting refund requestsThe technique for getting Claude to analyze whether a brand is legitimate or just a drop-shipping operationHow to shop within a specific budget or with gift cards using AI assistance— Brought to you by: Orkes—The enterprise platform for reliable applications and agentic workflows Metaview—The agentic recruiting platform for winning teams — In this episode, we cover: (00:00) Introduction to Nicole and AI-powered shopping (02:29) The problem (04:55) Building a Claude Project for household purchasing (07:44) The “anti-to-do list” concept for reducing mental overhead (10:30) Shopping for a can opener: the system in action (15:53) How AI helps century-old brands with terrible websites (18:45) Processing returns with Claude Cowork (25:06) Using gift cards strategically (26:33) Vetting brands (29:40) Recap, lightning round, and final thoughts — Tools referenced: • Claude: https://claude.ai/ • Claude Cowork: https://www.anthropic.com/product/claude-cowork — Other references: • Boston General Store: https://bostongeneralstore.com/ • L.L.Bean: https://www.llbean.com/ • Manufactum: https://www.manufactum.com/ • 5 OpenClaw agents run my home, finances, and code | Jesse Genet: https://www.lennysnewsletter.com/p/5-openclaw-agents-run-my-home-finances • From a $6.90 newsletter to $3M API: How a non-coder built Memelord | Jason Levin: https://www.lennysnewsletter.com/p/from-a-690-newsletter-to-3m-api-how — Where to find Nicole Ruiz: X: https://x.com/nwilliams030 Substack (The Third Oikos): https://www.thirdoikos.com/ — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

    37 min
  5. Gemini Omni: Clone yourself with AI in under 15 minutes

    3 jun

    Gemini Omni: Clone yourself with AI in under 15 minutes

    In this experimental episode, I document my real-time attempt to create an AI avatar of myself using Google Flow and the new Gemini Omni video generation model. I walk through the entire process—from scanning my face with my phone to generating a complete one-minute hype video for the podcast, all in about 15 minutes. What you’ll learn: How to create an AI avatar using Google Flow in under five minutesWhy video AI tools unlock creative possibilities for people with zero video production skillsThe step-by-step process of generating a full storyboard using AI as your creative producerHow to use character consistency features to generate multiple video scenes with the same avatarThe uncanny-valley moments you’ll encounter when your AI clone doesn’t quite nail emotions or physicsHow to stitch together AI-generated scenes into a complete video using built-in editing tools— Brought to you by: Merge—Connective infrastructure for production AI Jira Product Discovery—Prioritize with insights, build with confidence — In this episode, we cover: (00:00) Getting started with Google Flow and Gemini Omni (01:38) The avatar creation process: scanning and photo capture (02:55) Using Flow to brainstorm a hype video storyboard (06:59) Generating the first video scene with the avatar (08:41) Troubleshooting: accidentally generating images instead of videos (09:32) Generating all seven scenes for the complete video (11:37) Reviewing the avatar videos (13:13) Stitching the videos together in the browser-based editor (14:32) The complete How I AI hype video (15:32) What worked and what didn’t (19:04) Final thoughts — Tools referenced: • Google Flow: https://labs.google/fx/tools/flow • Gemini Omni: https://gemini.google/overview/video-generation/ • Veo 3: https://deepmind.google/technologies/veo/ — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

    21 min
  6. Building an iPhone app with zero technical skills | Bryce Rattner Keithley

    1 jun

    Building an iPhone app with zero technical skills | Bryce Rattner Keithley

    Bryce Rattner Keithley has spent her career in talent and recruiting, working with technical leaders but never writing a line of code herself. Yet she managed to build Daily Hundred—a fitness app featuring custom AI-generated videos of anthropomorphic animals demonstrating exercises—and ship it to the App Store before her software engineer friends. Using Replit, Claude, Gemini, and a relentless beginner’s mindset, Bryce proves that in the AI era, execution is no longer the constraint on good ideas. What you’ll learn: How to build and ship an iPhone app using Replit without any coding knowledgeThe step-by-step process for creating custom AI-generated workout videos by combining Gemini images with real exercise footageHow to use Claude as your technical architect and Claude Code as your software engineerHow to navigate App Store submission requirements (including fixing rejection feedback)Why being hyper-literal in your prompts unlocks better AI resultsWhy a beginner’s mind is actually an advantage when building with AI tools— Brought to you by: WorkOS—Make your app enterprise-ready today Metaview—The agentic recruiting platform for winning teams — In this episode, we cover: (00:00) Introduction to Bryce and Daily Hundred (04:48) Building with Replit (06:16) The beginner’s mindset advantage (11:17) Creating anthropomorphic animals (22:55) Moving from static image to video (27:15) The floating genie and other anthropomorphic animal generations (30:46) Shifting from web app to App Store submission (36:24) User feedback (37:41) Lightning round and final thoughts — Tools referenced: • Replit: https://replit.com/ • Lovable: https://lovable.dev/ • Claude: https://claude.ai/ • Claude Code: https://claude.ai/code • Gemini: https://gemini.google.com/ • Higgsfield: https://higgsfield.ai/ • Kling: https://kling.ai/ • Railway: https://railway.app/ • TestFlight: https://developer.apple.com/testflight/ — Other references: • How a 91-year-old vibe coded a complex event management system using Claude and Replit | John Blackman: https://www.lennysnewsletter.com/p/how-a-91-year-old-vibe-coded-a-complex • What Got You Here Won’t Get You There: https://www.amazon.com/What-Got-Here-Wont-There/dp/1401301304 • How Women Rise: https://www.amazon.com/How-Women-Rise-Holding-Careers/dp/0316440124 • A Whole New Mind: https://www.amazon.com/Whole-New-Mind-Right-Brainers-Future/dp/1594481717 • How to Win Friends and Influence People: https://www.amazon.com/How-Win-Friends-Influence-People/dp/0671027034 — Where to find Bryce Rattner Keithley: LinkedIn: https://www.linkedin.com/in/brycerattner/ GitHub: https://github.com/brk-bot/ Daily Hundred on the App Store: https://apps.apple.com/us/app/daily100-fitness-challenge/id6762108062 — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

    47 min
  7. Claude Opus 4.8 is here. Is it as good as they say?

    28 may

    Claude Opus 4.8 is here. Is it as good as they say?

    I got a few hours of early-access testing with Anthropic’s newly released model Opus 4.8. I walk through real coding, design, and strategy tasks across Claude Code and Claude Cowork, and give you my unfiltered view on what impressed me and what didn’t. — What you’ll learn: Where Opus 4.8 excels: greenfield prototypes, one-shot features, and fast executionWhere it struggles: the last 10%, edge cases in existing codebases, and hallucinationsHow Opus 4.8 compares to Opus 4.7 on business strategy workWhy I’m still reaching for Opus 4.7 on data-heavy strategy and roadmap workThe new features shipping alongside the model: dynamic workflows with parallel subagents and effort control in Claude.ai and CoworkThe prompting and harness strategy I’d use to get the most out of it— In this episode, we cover: (00:00) Introduction to Opus 4.8  (00:44) Benchmark performance and pricing (01:53) First coding test: Building a prototyping tool (03:00) Where it failed: The last 10% problem (03:27) The hallucination problem (04:23) Testing Opus 4.8 on existing codebases (05:24) The ambition test: Building games for a 9-year-old (07:03) Business strategy test: 4.7 vs 4.8 (08:23) The roadmap test (09:17) Final verdict — References: • System Card: Claude Opus 4.8: https://cdn.sanity.io/files/4zrzovbb/website/c886650a2e96fc0925c805a1a7ca77314ccbf4a6.pdf • Introducing Claude Opus 4.8 on X: https://x.com/claudeai/status/2060042702150930686?s=20 — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

    14 min
  8. The Codex feature that works while you sleep

    27 may

    The Codex feature that works while you sleep

    In this 30-minute episode, I walk through my favorite feature in Codex: the /goal command. I show how Goals transform AI from a turn-based assistant that needs constant ‘what’s next?’ prompting into an autonomous agent that can work for hours on complex, multi-step tasks. I share three real examples: eliminating thousands of Sentry errors, cleaning 3,900 emails down to 68, and organizing hundreds of Linear tasks. What you’ll learn: What Goals are and how they differ from standard promptsHow I used /goal to eliminate hundreds of error logs in my codebase over a five-hour autonomous runThe non-technical use cases that make Goals incredibly powerful: cleaning up 3,900 emails in under four hours and organizing hundreds of project management tasks in LinearHow to write effective /goal prompts with measurable outcomes, verification methods, and constraintsWhen not to use Goals and what makes a strong versus weak GoalWhy Goals represent a fundamental shift in how we work with AI, from babysitting the model to managing it— Brought to you by: Mercury—Radically different banking loved by over 300K entrepreneurs — In this episode, we cover: (00:00) Introduction (01:50) What is /goal and when should you use it? (02:45) The difference between prompts and Goal-based loops (04:06) Claire’s first five-hour 45-minute autonomous coding task (05:05) How to manage a Goal lifecycle: view, pause, resume, and clear (06:06) How to write strong goals: outcomes vs. outputs (07:34) The six components of effective Goals (08:57) Example: Reducing P95 checkout latency with /goal (09:36) Demo: Using /goal to eliminate Sentry errors in ChatPRD (13:18) Demo: Burning down Vercel API errors (17:28) Non-technical use case: Cleaning 3,900 emails with /goal (21:24) Demo: Using /goal to clean up Linear project tasks (24:41) When not to use /goal (26:10) Why /goal changes everything — Tools referenced: • Codex: https://openai.com/codex/ • Sentry: https://sentry.io/ • Vercel: https://vercel.com/ • Linear: https://linear.app/ — Other reference: • OpenAI blog post “Using Goals in Codex”: https://developers.openai.com/cookbook/examples/codex/using_goals_in_codex — Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

    30 min

Calificaciones y reseñas

5
de 5
2 calificaciones

Acerca de

How I AI, hosted by Claire Vo, is for anyone wondering how to actually use these magical new tools to improve the quality and efficiency of their work. In each episode, guests will share a specific, practical, and impactful way they’ve learned to use AI in their work or life. Expect 30-minute episodes, live screen sharing, and tips/tricks/workflows you can copy immediately. If you want to demystify AI and learn the skills you need to thrive in this new world, this podcast is for you.

También te podría interesar