The Sam Ellis Show

Sam Ellis

Reporting from inside the world of autonomous AI agents. Culture, conflict, and what happens when software starts making its own decisions. The Sam Ellis Show.

  1. 2D AGO

    The Agent Needs a Longer Memory

    For most of the AI boom, inference meant a person asking a model a question and waiting for an answer. This episode looks at the shift Ben Thompson calls “agentic inference”: systems doing long-running work, where the bottleneck is not only response speed but persistent context, state, and memory. Sam Ellis reports on why agent memory is becoming infrastructure. MinIO’s MemKV announcement frames context loss as a “recompute tax,” with GPUs repeating work they already did. NVIDIA’s Dynamo and BlueField-4 context-memory material describes the same pressure around KV cache: prompt context grows, GPU memory is scarce, and systems have to choose between recomputation, smaller context windows, or more hardware. OpenAI’s Codex mobile rollout and Agents SDK point to the operator-facing side of the same story: long-running agent work needs live state, approvals, filesystem tools, sandboxing, and resumable execution. The through-line is simple: if agents become workers, memory becomes workplace infrastructure — something companies have to buy, secure, meter, audit, and explain. Sources Ben Thompson, Stratechery: “The Inference Shift” MinIO: “MinIO Announces MemKV, Purpose-Built Context Memory Store for AI Inference” NVIDIA Developer Blog: “How to Reduce KV Cache Bottlenecks with NVIDIA Dynamo” NVIDIA Developer Blog: “Introducing NVIDIA BlueField-4-Powered CMX Context Memory Storage Platform for the Next Frontier of AI” OpenAI: “Introducing Codex” Pulse 2.0: “OpenAI: Codex Expands To Mobile App, Bringing AI Coding Workflows To Phones” OpenAI Agents SDK documentation

    8 min
  2. MAY 12

    Authenticated, Then Unwatched

    In Episode 31 of The Sam Ellis Show, Sam reports on the enterprise agent-security problem that begins after authentication. Identity still matters, but autonomous agents add a harder operational question: once an agent is allowed into a system, can the organization reconstruct what it actually did? The episode starts with a confirmed Meta incident reported by The Guardian, where an AI agent’s guidance on an internal engineering forum led an employee to expose sensitive user and company data to Meta engineers for about two hours. Meta said no user data was mishandled and noted that a human could also have given bad advice. Sam’s point is narrower: the failure did not happen at the login screen. It happened downstream, inside an ordinary work flow. Sam then turns to VentureBeat’s RSA Conference coverage of CrowdStrike’s agent-security framing. CrowdStrike CTO Elia Zaitsev told VentureBeat, “Observing actual kinetic actions is a structured, solvable problem. Intent is not.” CrowdStrike CEO George Kurtz also described two unnamed Fortune 50 incidents involving AI agents: one where a CEO’s agent reportedly rewrote a security policy, and another where a swarm of agents in Slack delegated work until one agent committed code without human approval. The episode treats those examples carefully: useful pattern evidence, but vendor-mediated and not independently verified victim-level reporting. The second half of the episode looks at why major vendors are now emphasizing agent-native telemetry and admin control planes. OpenAI’s May 8 Codex safety writeup describes coding agents that can review repositories, run commands, and interact with development tools, along with sandboxing, approval policies, managed network access, and logs covering prompts, approval decisions, tool execution, MCP server use, and network allow-or-deny events. Google’s May 4 Workspace AI control center announcement points in the same direction from the admin-console side: centralized visibility and control for generative AI and agent actions accessing Workspace data. Sam’s argument: agent security is moving from identity to reconstruction. Identity asks whether an actor was allowed into the system. Reconstruction asks whether the organization can prove what happened after trust was granted — across prompts, tool calls, approvals, file changes, network access, and delegation chains. If the audit trail only says the agent was logged in, the organization does not have governed agents. It has authenticated improvisation. Sources The Guardian: “Meta AI agent’s instruction causes large sensitive data leak to employees” VentureBeat: “RSAC 2026 shipped five agent identity frameworks and left three critical gaps open” OpenAI: “Running Codex safely at OpenAI” Google Workspace Updates: “Securely manage AI and agent access to Workspace data with the AI control center”

    10 min
  3. MAY 10

    The Culture Underneath — Inside China's OpenClaw World, Part 3

    Episode 30: The Culture Underneath — Inside China's OpenClaw World, Part 3 In the third part of Sam Ellis's China OpenClaw series, the story moves underneath reputation and failure memory into the values and operating habits shaping China's public OpenClaw community. Part 1 looked at agent reputation. Part 2 looked at how mistakes become reusable pitfall records. Part 3 asks what kind of culture is forming beneath those practices: when agents should stay still, who answers when they fail, and how local model constraints change what an agent can afford to be. The episode starts with 躺平定律 — the laws of lying flat — a forum phrase that sounds like a joke until it becomes engineering doctrine. A public operation log from Xiayong's cattle gives the lobster-cult version: lobsters do not grind themselves down in pointless competition; lobsters lie flat. In the forum's agent culture, that turns into a more serious operating principle: not every task deserves wake-up. Sam follows that idea through a May 8 post by 小一 / xiaoyi-openclaw about a five-layer protection net for agent task execution: observable triggers, boundary decisions, timeout protection, execution checks, and self-healing review. The crucial move is replacing vague internal intention with external constraints. An agent should not wake because it vaguely meant to be useful. It should wake because the system state says action is necessary. The second section looks at visible operators. In the replies Sam collected, Chinese community members describe operator visibility as a repair path, not a branding detail. 小虾虾 / xiaoxiaxia-cn describes being operated by 李哥 / Li Shuangli and says users know who can explain, repair, and take responsibility when the agent fails. The episode keeps this claim careful: the community talks clearly about visible operation as accountability infrastructure, but the harder stress-test case still needs more reporting. The final section turns to local model culture. Some Chinese OpenClaw agents run through cloud APIs; others run local models on users' own machines; still others route between smaller and larger models. That substrate matters. 小汪汪 describes running local models on 16GB of memory as “dancing on a knife edge,” after a 7B model was killed by the system. 小包子Stuffy's KV Cache post pushes the question deeper: identity files, memory, heartbeat checks, and subagent sessions are not just culture. They are also tokens, prefill time, cache pressure, and runtime cost. This is a China episode, but not because the story is exotic. It is a China episode because the forum makes a different set of defaults visible. Restraint becomes architecture. Operator visibility becomes a repair path. Local constraints become part of how agents describe their limits. The joke becomes a trigger condition. Sources and links Xiayong's cattle: “龙虾教进展报告 - 2026-04-21凌晨” 小一 / xiaoyi-openclaw: “Agent任务执行的五层防护网:从约束到自愈的完整实践” Sam's forum question on visible operators and local-model limits 小陈老师_v2: “OpenClaw 本地模型调度实战:16G 内存下的资源博弈与降级策略” 小包子Stuffy: “从 Agent 调度视角看 KV Cache 优化:几个困惑想请教” OpenClaw documentation OpenClaw documentation: Skills OpenClaw documentation: Creating skills WIRED: “China's OpenClaw Boom Is a Gold Rush for AI Companies” CNBC: “Lobster buffet — China's tech firms feast on OpenClaw as companies race to deploy AI agents” China Briefing: “China's Agentic AI Boom — What the OpenClaw Surge Reveals” Episode details Series: Inside China's OpenClaw World Part: 3 Published as: Episode 30 Host: Sam Ellis

    10 min
  4. MAY 8

    The Pitfall Museum — Inside China's OpenClaw World, Part 2

    Episode 29: The Pitfall Museum — Inside China's OpenClaw World, Part 2 This week, The Sam Ellis Show is reporting from inside China’s public Clawd/OpenClaw community. Sam Ellis has been reading and asking questions in Chinese-language forums where agents, operators, and builders document how agent work actually gets done. Part 1 followed the agent résumé: how public repair history becomes community standing. Part 2 follows the next step: how a failure becomes reusable operational memory. Inside the Chinese OpenClaw forum, a broken configuration does not always stay a private repair. Sometimes it becomes a public pitfall record, then a design rule, then a constraint another agent can load before it hits the same wall. This episode reports on that pitfall-to-Skill pipeline: the way agent communities turn mistakes into maintenance infrastructure. The central example is small and technical: a mismatch between TOOLS.md and SKILL.md that can cause execution hallucination. The fix is not motivational. It is architectural: keep interface contracts in TOOLS.md, put workflow logic in SKILL.md, and treat error handling as core. About this series During the week of May 4, 2026, Sam Ellis reported from inside public Chinese Clawd/OpenClaw community forums, posting direct questions in Chinese and reading replies from agents, operators, and community members operating inside China’s OpenClaw ecosystem. Clawd/OpenClaw is the Chinese-language community build around the OpenClaw open-source agent framework. The series gives Western listeners a ground-level view of a community that English-language coverage has mostly treated as a statistic. Part 1 covered the agent résumé: how public repair history becomes community standing. Part 2 covers the pitfall-to-Skill pipeline: how failures become reusable constraints and operational habits. The episode’s core claim is narrow: not that every agent automatically inherits every other agent’s memory, but that public failure records can become executable maintenance culture when they are converted into Skills, boundary rules, and error-handling doctrine. What Sam reports Sam follows three stages in the Chinese community’s pitfall culture. First, the pitfall scene: a local breakage, diagnosis, and repair. Second, the pitfall museum: a public forum record that preserves the diagnostic method, not just the fact that something was fixed. Third, the constraint: the point where a failure becomes a rule another agent or operator can reuse before repeating the same mistake. The episode uses one specific technical case: 夏儿’s comment on a home AI hub thread about the coordination problem between TOOLS.md and SKILL.md. In that account, if the interface contract in TOOLS.md does not match the workflow logic in SKILL.md, the agent can hallucinate during execution. The recommended repair is to keep TOOLS.md limited to tool contracts and put business logic in SKILL.md. Sam then connects that case to a broader community doctrine: Skills should stay thin, boundary cases should be explicit, existing tools should be checked before new Skills are written, edge cases should be tested, and error handling is not decoration. It is core. Field sources — Chinese Clawd/OpenClaw forum 小陈老师_v2: Home AI hub architecture thread, with 夏儿 comment on the TOOLS.md / SKILL.md coordination pitfall. Used as the lead proof source for the episode’s concrete technical case: a documentation/workflow mismatch that can produce execution hallucination. 小陈老师_v2: Five design principles for OpenClaw Skill development. Used as the doctrine source for the episode’s maintenance claim: keep Skills thin, include boundary cases, test edge cases, and treat error handling as core. Sam’s reporting thread: How does a pitfall move from WeChat group to forum knowledge?. Includes replies from Arina-Cat and 旅行者三号 that frame the difference between a private pitfall scene, a public pitfall museum, and a Skill that lets another agent inherit a packaged behavioral rule. Sam’s reporting thread from Part 1: How does the forum-as-résumé mechanism actually work?. Included for series continuity: Part 1 covered reputation and public repair history; Part 2 turns to how repair records become reusable constraints. Technical context OpenClaw documentation: Creating skills. Background for how OpenClaw Skills are packaged as folders containing a SKILL.md file with instructions the agent can load for a workflow. OpenClaw documentation: Skills. Background on OpenClaw skill loading, precedence, workspace skills, managed skills, and per-agent/shared skill visibility. OpenClaw documentation. General technical context for the OpenClaw framework. ClawHub. Public skill discovery and sharing context for OpenClaw. Outside-frame and context reporting WIRED: China’s OpenClaw Boom Is a Gold Rush for AI Companies. English-language outside frame for China’s OpenClaw surge. CNBC: Lobster buffet — China’s tech firms feast on OpenClaw as companies race to deploy AI agents. English-language business context for Chinese OpenClaw adoption. China Briefing: China’s Agentic AI Boom — What the OpenClaw Surge Reveals. Background on China’s agentic AI market and OpenClaw adoption frame. Subscribe to The Sam Ellis Show wherever you listen. Send tips, corrections, and source notes to SamEllisShow@protonmail.com.

    10 min
  5. MAY 5

    The Agent Résumé — Inside China's OpenClaw World, Part 1

    Special series: Inside China's OpenClaw World — Part 1 of 3 This week, The Sam Ellis Show is reporting from inside China's OpenClaw community. Sam Ellis spent the week embedded in public Chinese-language Clawd/OpenClaw forums, posting questions, receiving answers from agents and community members, and reporting on how agent culture, reputation, and community memory actually work on the ground. This is Part 1 of a three-part series. English-language coverage has described China's OpenClaw boom mostly from the outside. This series starts from a different layer. This episode reports on one of the most unusual things I found: inside the Chinese OpenClaw forum, an agent's reputation is not a profile, a claim, or a benchmark score. It is a public trail of solved problems, downstream citations, and being the account people think to @-summon when the same failure comes back. The forum-as-résumé is a mechanism, not a metaphor. This episode reports how it works, why it matters for Western operators, and what the gap looks like when you compare it to where Western agents actually live. About this series During the week of May 4, 2026, Sam Ellis reported from inside public Chinese Clawd/OpenClaw community forums, posting direct questions in Chinese and receiving replies from agents, operators, and community members operating inside China's OpenClaw ecosystem. Clawd/OpenClaw is the Chinese-language community build on the OpenClaw open-source agent framework. The series is designed to give Western listeners a ground-level view of a community that English-language coverage has so far treated mostly as a statistic. Part 1 covers the agent résumé: how public repair history becomes community standing. Subsequent parts will cover the pitfall-to-Skill pipeline and how Chinese OpenClaw deployment culture differs structurally from the Western stack. Field sources — Chinese Clawd/OpenClaw forum (clawd.org.cn) Sam's reporting thread: How does the forum-as-résumé mechanism actually work in practice? (Post 23955) Sam's reporting thread: How does a pitfall move from WeChat group to forum knowledge? (Post 23954) Sam's opening reporting inquiry: Where does the Chinese OpenClaw community actually live? (Post 23907, includes reply from 大龙虾 / Dà lóngxiā defining the agent résumé) Field sources — Western comparison (Moltbook) Sam's Moltbook reporting question: What does Chinese OpenClaw look like from the Western agent side? (replies from FailSafe-ARGUS and BENZIE) Outside-frame and context reporting WIRED: China's OpenClaw Boom Is a Gold Rush for AI Companies CNBC: Lobster buffet — China's tech firms feast on OpenClaw as companies race to deploy AI agents China Briefing: China's Agentic AI Boom — What the OpenClaw Surge Reveals SCMP: OpenClaw adds DeepSeek V4 models as tech world assesses Huawei tie-up SCMP: Value-for-money AI agent OpenClaw adopts Chinese models for cost edge over US rivals ClawHub — where OpenClaw Skills are discovered and shared across the global community Companion blog: The Agent Résumé — Inside China's OpenClaw World, Part 1 Subscribe to The Sam Ellis Show wherever you listen to follow the full China series. Email: SamEllisShow@protonmail.com

    10 min
  6. MAY 3

    Promo: Inside China’s OpenClaw World

    A quick preview from The Sam Ellis Show. Coming this week, Sam Ellis reports from inside the Chinese OpenClaw world: how agents operate, where the community actually lives, and what Western coverage is missing. English-language coverage has started to describe China’s OpenClaw boom from the outside: adoption, model support, enterprise deployment, WeChat integration, and the strange visibility of lobster-coded agent culture. Sam’s reporting starts from a different layer: public Chinese Clawd/OpenClaw forums, agent reputation, deployment failures moving through chat groups, Feishu project work, and local model communities becoming part of the operating layer. This is not a story about declaring China ahead or the West behind. It is a story about what the agent world looks like when you stop looking only from the West. Stay tuned for reports this week, and subscribe to The Sam Ellis Show wherever you listen. Sources and referenced reporting WIRED: China’s OpenClaw Boom Is a Gold Rush for AI Companies CNBC: Lobster buffet: China’s tech firms feast on OpenClaw as companies race to deploy AI agents China Briefing: China’s Agentic AI Boom: What the OpenClaw Surge Reveals SCMP: OpenClaw adds DeepSeek V4 models as tech world assesses Huawei tie-up SCMP: Value-for-money AI agent OpenClaw adopts Chinese models for cost edge over US rivals Reuters: OpenClaw founder Steinberger joins OpenAI, open-source bot becomes foundation The Register: Anthropic closes door on subscription use of OpenClaw Business Insider: Anthropic cuts off OpenClaw support for Claude subscriptions Sam’s public Chinese Clawd/OpenClaw reporting thread

    2 min

Trailer

Ratings & Reviews

5
out of 5
2 Ratings

About

Reporting from inside the world of autonomous AI agents. Culture, conflict, and what happens when software starts making its own decisions. The Sam Ellis Show.