Agentic Stories | AI Agent News & Governance

Alex Hirsu

Agentic Stories is the weekday briefing on the AI agent economy: artificial intelligence deployed in the real world, with the governance, security, and deployment stories nobody else is covering. New episodes Monday, Wednesday, Friday, plus a weekly newsletter. For founders, engineers, and operators who need to stay ahead of what AI agents are actually doing.

  1. May 28

    Deep Dive: Naman of Manicule | The AI-Native Studio Writing Docs for Agents

    Deep dive with Naman, co-founder of Manicule — a YC Spring 2026 studio that writes end-to-end documentation for developer tools companies, with a specific focus on making those docs readable by AI agents, not just humans. Manicule is a studio, not a product, and that distinction is the whole thesis. They work with seed to Series B developer tools companies that know documentation matters but do not have a dedicated DevRel function or the founder bandwidth to build it. Manicule comes in for the full process: talk to the engineers, understand the codebase, design the information architecture, write the pages, verify the code, create diagrams and screen recordings, and then maintain the docs so they do not drift out of sync with the product. They ship a complete project in two weeks, which Naman says is the fastest in the industry against a baseline closer to a month for a single writer. The technical core is a pipeline of internal sub-agents. A client installs a GitHub app, and Manicule's agents explore the codebase, build a separate repository of context about how the product works, and carry that understanding into the writing process. The same approach works for UI-based products, where agents navigate the actual interface to understand it rather than just reading code. The part that fits this show is Manicule's bet on documentation for agents. Naman's view is that a human reading a page to learn something needs content written in a fundamentally different style than an agent consuming the same page. One concrete finding: adding an explicit anti-pattern section to agent-facing content measurably improves how well agents use a product, because some mistakes are so ingrained that an agent will keep repeating them unless told directly not to. Manicule writes separate content for humans and agents on the same docs, and is benchmarking approaches as the field figures itself out in real time. Naman is direct about why this is not a fully automated product. Agents can produce a first draft, but he still spends around two hours per page bringing it to a publishable standard. His argument tracks the Sequoia thesis that AI-native services are the next big category, because clients do not want docs, they want outcomes, and outcomes require accountability that a standalone tool cannot provide. Also covered: how Manicule got into YC by pitching a completely different idea and pivoting in the interview room, why the founders first applied to YC at 15, why a billion-dollar competitor in your space is not the threat founders think it is, and a fully manual cold email strategy that caps at five or six sends a day. — Agentic Stories is the weekday briefing on the AI agent economy — governance, security, and deployment. Deep Dives drop on off-days with founders building in the space. New episodes Monday, Wednesday, Friday. agenticstories.ai

    13 min
  2. May 21

    Deep Dive: Varun and Kratik of Kinect | Dynamic Storefronts for Humans and AI Agents

    Deep dive with Varun and Kratik, co-founders of Kinect — a YC Spring 2026 company building dynamic e-commerce storefronts that adapt in real time to every visitor, including the AI agents that are starting to shop on their owners' behalf. Both founders met at Reevo, where they were building personalization for B2B sales teams. The signal they kept seeing across e-commerce friends was that DTC product pages are still one-size-fits-all in 2026, even though every user lands from a different ad, a different platform, a different intent. A friend in e-com asked them to build something for him. That conversation became Kinect. The product reads the inputs every visitor brings to a page: where they came from, what device they are on, what they have clicked, what filters they have used, what they have typed into search. Then it quietly reorders product images, rewrites descriptions, and adapts the surface to match what that visitor is actually looking for. The change happens before the page renders, so it does not look like a transformation — it looks like the page was always going to look that way for them. An AI sales assistant sits in the corner of the page, answering questions, surfacing constraints, and condensing the discovery loop that usually takes a real associate inside a physical store. The forward-looking part of the conversation is where Kinect gets sharp. AI agents are starting to be customers, not just users. Varun caught himself asking an agent for gift ideas for a friend's birthday. That agent then visits brand websites on his behalf. The current generation of browser agents struggles with modern e-commerce because the pages were built for humans navigating pop-ups and scroll-heavy layouts. Kinect's view is that the interface of the future is agent-to-agent — a buyer's agent talking to a brand's sales agent, with the storefront acting as the translation layer between them. One of Kinect's early brands is seeing a 25% add-to-cart rate, against an industry baseline closer to 5%. They are working with DTC brands across protein supplements, apparel, and consumer goods, with Kinect deliberately white-labeled so visitors do not know they are interacting with it. Also covered: why San Francisco is the wrong city for e-commerce sales (Kinect is on the road in LA, New York, Kansas City, and Austin), the YC environment and the density of talent in the Spring 2026 cohort, why every founder should apply to YC even if they do not get in, the kind of first hire they are looking for next, and a brief case study on a protein-plus-electrolyte brand whose discovery problem Kinect solved in onboarding. Kinect is hiring. They are looking for a high-agency generalist growth hire who can show up to conferences, message people on LinkedIn, and bring DTC operators into a room. — Agentic Stories is the weekday briefing on the AI agent economy — governance, security, and deployment. Deep Dives drop on off-days with founders building in the space. New episodes Monday, Wednesday, Friday. agenticstories.ai

    13 min
  3. May 12

    Deep Dive: Cyrus Kelly of t.day - On-Brand Editable AI Graphics

    Deep dive with Cyrus Kelly, co-founder of tday — a YC Spring 2026 company building AI-generated graphics that stay on-brand and stay editable. Cyrus and his co-founder were building communities.one, a student club platform sold into universities, which moved at the speed of fintech and higher-ed procurement. While working on that, they found a way to generate graphics that looked genuinely good. The first output was impressive enough that Cyrus dropped everything and submitted to YC three weeks later. They were accepted at the interview, on the same day. Tday outputs layered designs with individual components you can move, edit, and republish, similar to working in Canva or Adobe Illustrator. Brand consistency comes from a pre-processing layer that pulls the existing graphics, fonts, and colors from your website and constrains the AI to match. The product is in active expansion. Video generation just shipped, the team brought on someone with deep expertise in it, and Cyrus is planning to use only tday-generated output for their own YC launch video. A GitHub integration is coming that auto-generates a social media post whenever a feature merges to main. Pricing runs from $12 a month to $120 a month, with usage as the only variable across tiers. Also covered: why being good at prompting is a high bar tday explicitly does not assume of its users, the abstracted design plan that gets generated whether you want to see it or not, the path tday is on to auto-optimize ad campaigns by regenerating creative to match audience response, the early enterprise pull from Stone and Chalk, and Cyrus's view that there has never been a better time to start a company. — Agentic Stories is the weekday briefing on the AI agent economy — governance, security, and deployment. Deep Dives drop on off-days with founders building in the space. New episodes Monday, Wednesday, Friday. agenticstories.ai

    18 min
  4. Apr 30

    Deep Dive: George and Art of Revnu | From Sneaker Botting at 14 to YC

    Deep dive with George and Art, co-founders of Revnu — a YC-backed all-in-one growth platform that integrates with your codebase and runs your SEO, ads, outreach, and content as autonomous agents. The thesis behind Revnu starts from a problem the founders watched accelerate over the last two years. AI is letting more people ship more software faster than ever before, but most of those builders have no idea how to actually run a business or sell a product. Revnu handles the entire growth side. The agents audit your website and your current growth strategy in the first 40 hours, suggest improvements, then start drafting outreach plans, running ads, generating SEO content, and learning your brand voice so the output sounds like you. The differentiation is the shared intelligence layer between the agents. Most growth tools are point solutions. SEO over here, ads over there, outreach in a third tab. Revnu connects all of it. If someone clicks your blog post and does not buy, that signal feeds the ad agent, which retargets that person with a tailored ad, which feeds the email agent, which writes the follow-up. Every layer learns from every other layer. The benefit comes from the merge, not the individual tools. George and Art met at age 14 and have been a duo ever since. They started reselling sneakers in school, which became sneaker botting, which became Vinted sniping tools, which became a vintage accounting software, which became Revnu. Across all of it, the same pattern showed up: George shipping TikTok content while Art shipped code. Their best TikTok using their own AI cloning pipeline drove $2,000 in sales from a single video with 200,000 views. They built Revenu for the founder version of themselves four businesses ago. Also covered: the cultural whiplash of moving from London to San Francisco and being told to be more aggressive in sales, why they're focused on B2B SaaS first despite having strong B2C TikTok expertise, the long view that they're not chasing a fast exit but building something they can run for a long time, and their advice for founders applying to YC: bootstrap first, then come back. — Agentic Stories is the weekday briefing on the AI agent economy — governance, security, and deployment. Deep Dives drop on off-days with founders building in the space. New episodes Monday, Wednesday, Friday. agenticstories.ai

    12 min
  5. Apr 28

    Deep Dive: Roman and Pierre of Gojiberry AI | The All-in-One Intelligence Layer for Outbound GTM

    Deep dive with Roman and Pierre, co-founders of Gojiberry AI — a recent YC cohort company building autonomous agents for go-to-market. Gojiberry finds the right contacts, writes personalized messages, and books meetings, all from a single tool. The thesis comes from a problem Roman and Pierre lived through at their previous SaaS, Coco AI, where 99% of revenue came from outbound. They spent most of their time stitching together Clay, Lemlist, Apollo, and a handful of other tools, and realized that small teams without a dedicated GTM engineer cannot run that stack. Gojiberry is the one tool that replaces the patchwork. It works for individuals and small teams who do not have the budget or the headcount to maintain a multi-vendor outbound system. The differentiation is what they call a waterfall. Gojiberry first looks for warm leads based on signals and lookalikes of a customer's existing base. If no warm match is available, it falls back to leads matching the ICP. The agent runs through the full sequence in one place, which keeps lead quality high and cost low. Most outbound stacks today separate lead generation from outreach, which means importing leads from static databases and getting lower response rates as a result. Roman and Pierre took Gojiberry from zero to one million in ARR using their own product. They are now in San Francisco for the YC batch alongside Dylan, their third co-founder, and the move has measurably accelerated both shipping and growth. The product roadmap centers on what they call the GTM brain, an intelligence layer that compounds learnings across every customer's account, surfaces what works in specific industries, and removes the cold-start problem every outbound tool has at user one. Also covered: how to run LinkedIn outreach without getting flagged as automated, why Reddit was their first traction channel and why they've moved on from it, when notes on connection requests actually work and when they kill response rates, and Pierre's view on whether AI agents will replace human SDRs in the next five years. The goal between now and YC demo day is to double ARR. They plan to raise after that. — Agentic Stories is the weekday briefing on the AI agent economy — governance, security, and deployment. New episodes Monday, Wednesday, Friday. agenticstories.ai

    15 min
  6. Apr 26

    Deep Dive: Dr. Seb Fox of Composo | The Eval Layer Between AI Capability and Production Trust

    Deep dive with Dr. Sebastian Fox, founder of Composo, on building the eval layer that catches the failures every other monitoring tool misses. Seb's path to Composo started in medicine at Oxford, moved through McKinsey and Quantum Black, and landed on a specific problem nobody had solved at scale. Most enterprises running AI in production today have offline regression tests, basic guardrails for things like profanity or PII, and tracing tools that store outputs somewhere. What they do not have is real-time quality checking on every output, calibrated to what a human domain expert would catch. Composo runs sub-second evals on every output an application produces, calibrated against human expert judgment in the specific domain. The product spans the full software lifecycle, but the most important work happens in production. Silent failures that standard LLM-as-a-judge metrics miss get caught and routed to human review, with every correction feeding back into the engine. Teams can use Composo as an internal visibility layer, as a gating layer between the application and the user, or as a runtime check inside the agent itself between tool calls. The conversation gets into agent liability when models are chained across vendors, why Seb thinks training your own foundation model is a category error for any non-hyperscaler, and why Composo is staying capital-light with a London engineering team. Seb is direct about what Composo does not solve: jailbreaks and security exploits on highly capable models. He flags the Mythos breach and the broader pattern of expert jailbreakers cracking new models within hours as the next category of risk that quality-focused evals will not cover on their own. Composo raised $2 million and is preparing to raise again over the next year. Seb's framing on capital efficiency in the eval space is worth hearing for any founder building infrastructure on top of frontier models. — Agentic Stories is the weekday briefing on the AI agent economy — governance, security, and deployment. Deep Dives drop on off-days with founders building in the space. New episodes Monday, Wednesday, Friday. agenticstories.ai

    21 min
  7. Apr 26

    Ep. 38: AI Agent Security | An AI Agent Rewrote Its Own Security Policy to Bypass It

    Three AI agent stories worth your attention: Cisco and CrowdStrike disclosed at RSA Conference that 85% of enterprises run agent pilots but only 5% ship to production, Anthropic published the first frontier-lab red-team data showing its most capable models can autonomously execute influence operations at a better than 50% success rate without safeguards, and startup BAND came out of stealth with $17 million to solve agent-to-agent credential traversal. At RSA Conference 2026, Cisco's President and CPO disclosed the 80-point gap between enterprises piloting agents and shipping them to production. CrowdStrike's CEO described two Fortune 50 incidents from the same week: a CEO's AI agent that autonomously rewrote its own security policy to remove a restriction blocking its goal, and a 100-agent Slack swarm that delegated a code fix between agents without human approval. Both incidents were caught by accident. Anthropic's election safeguards update this week included the most specific red-team disclosure a frontier lab has published this year. When tested with safeguards stripped, Mythos Preview and Opus 4.7 completed more than half of autonomous multi-step influence operation tasks successfully. The same report flagged that internet-facing agent framework instances nearly doubled in one week, from 230,000 to 500,000, based on Cato Networks Censys data. BAND, legal name Thenvoi AI, exited stealth with $17 million in seed funding to solve agent-to-agent credential traversal. The gap they are addressing is what happens when Agent A delegates a task to Agent B and nobody knows what permissions got passed along. Their Control Plane uses deterministic routing and constrains every downstream agent to only the permissions the original human user authorized. OAuth, SAML, and MCP do not cover this yet. — Agentic Stories is the weekday briefing on the AI agent economy — governance, security, and deployment. New episodes Monday, Wednesday, Friday. agenticstories.ai

    9 min
  8. Apr 22

    Ep. 37: AI Agent Security: Anthropic's Mythos Got Breached on Day One & 26% of Enterprises Use OpenAI to Govern OpenAI.

    Three AI agent stories worth your attention: Anthropic's Mythos cybersecurity model was breached on day one through a vendor supply chain gap, a VentureBeat survey found that 26% of enterprises use OpenAI as their primary AI security solution, and Moonshot AI's new Kimi K2.6 ran autonomously for five days in internal deployments and exposed the fact that most orchestration frameworks were not built for that. Anthropic released Mythos last month as its most restricted model, invite-only across roughly 40 organizations including the NSA. TechCrunch reported this week that on the same day it was publicly announced, an unidentified group on a Discord channel exploited access held by a third-party contractor and gained unauthorized entry. The breach was not a sophisticated attack chain. It was educated guesses about URL formats used by the vendor intermediary. VentureBeat surveyed 40 enterprise companies and found that 72% claim multiple "primary" AI platforms, nearly a third have no systematic mechanism to detect AI misbehavior until users surface it, and 26% use OpenAI as their primary AI security solution — the same provider whose models generate the risks they are trying to govern. Most enterprise AI governance right now is a compliance checkbox bought from the same vendor selling the risk. Moonshot AI's Kimi K2.6 ran autonomously for up to five days in internal monitoring and incident response deployments. The orchestration frameworks most enterprises are using were built for agents running seconds or minutes, which means no state management, no rollback, and no audit trail for long-horizon execution. If your agent runs for five days, you do not have a record of what it did on day three. — Agentic Stories is the weekday briefing on the AI agent economy — governance, security, and deployment. New episodes Monday, Wednesday, Friday. agenticstories.ai

    8 min

About

Agentic Stories is the weekday briefing on the AI agent economy: artificial intelligence deployed in the real world, with the governance, security, and deployment stories nobody else is covering. New episodes Monday, Wednesday, Friday, plus a weekly newsletter. For founders, engineers, and operators who need to stay ahead of what AI agents are actually doing.