LessWrong posts by zvi

zvi

Audio narrations of LessWrong posts by zvi

  1. 23시간 전

    “Claude Opus 4.6 Escalates Things Quickly” by Zvi

    Life comes at you increasingly fast. Two months after Claude Opus 4.5 we get a substantial upgrade in Claude Opus 4.6. The same day, we got GPT-5.3-Codex. That used to be something we’d call remarkably fast. It's probably the new normal, until things get even faster than that. Welcome to recursive self-improvement. Before those releases, I was using Claude Opus 4.5 and Claude Code for essentially everything interesting, and only using GPT-5.2 and Gemini to fill in the gaps or for narrow specific uses. GPT-5.3-Codex is restricted to Codex, so this means that for other purposes Anthropic and Claude have only extended the lead. This is the first time in a while that a model got upgraded while it was still my clear daily driver. Claude also pulled out several other advances to their ecosystem, including fast mode, and expanding Cowork to Windows, while OpenAI gave us an app for Codex. For fully agentic coding, GPT-5.3-Codex and Claude Opus 4.6 both look like substantial upgrades. Both sides claim they’re better, as you would expect. If you’re serious about your coding and have hard problems, you should try out both, and see what combination works [...] --- Outline: (01:55) On Your Marks (17:35) Official Pitches (17:56) It Compiles (21:42) It Exploits (22:45) It Lets You Catch Them All (23:16) It Does Not Get Eaten By A Grue (24:10) It Is Overeager (25:24) It Builds Things (27:58) Pro Mode (28:24) Reactions (28:36) Positive Reactions (42:12) Negative Reactions (50:40) Personality Changes (56:28) On Writing (59:11) They Banned Prefilling (01:00:27) A Note On System Cards In General (01:01:34) Listen All Yall Its Sabotage (01:05:00) The Codex of Competition (01:06:22) The Niche of Gemini (01:07:55) Choose Your Fighter (01:12:17) Accelerando --- First published: February 11th, 2026 Source: https://www.lesswrong.com/posts/5JNjHNn3DyxaGbv8B/claude-opus-4-6-escalates-things-quickly --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    1시간 14분
  2. 2일 전

    “Claude Opus 4.6: System Card Part 2: Frontier Alignment” by Zvi

    Coverage of Claude Opus 4.6 started yesterday with the mundane alignment and model welfare sections of the model card. Today covers the kinds of safety I think matter most: Sabotage, deception, situational awareness, outside red teaming and most importantly the frontier, catastrophic and existential risks. I think it was correct to release Opus 4.6 as an ASL-3 model, but the process Anthropic uses is breaking down, and it not on track to reliably get the right answer on Opus 5. Tomorrow I’ll cover benchmarks, reactions and the holistic takeaways and practical implications. I’m still taking it all in, but it seems clear to me that Claude Opus 4.6 is the best model out there and should be your daily driver, with or without Claude Code, on most non-coding tasks, but it is not without its weaknesses, in particular in writing and falling into generating more ‘AI slop’ style prose than Claude Opus 4.5. For coding tasks, I presume that Opus 4.6 with Claude Code is the play, especially with Agent Teams and fast mode available, and I’m using it myself, but Codex with GPT-5.3-Codex-Max is also a strong model and a viable alternative, and a fully [...] --- Outline: (01:32) Sabotage, Deception and Evaluation Integrity (03:42) Sandbagging On Dangerous Capability Evaluations (06:01) Situational Awareness (07:33) Inhibiting Evaluation Awareness (6.5) (09:06) Self-Preference (10:24) UK AISI Testing (11:40) Apollo Research Testing (14:24) Responsible Scaling Policy Evaluations (15:45) CBRN (mostly Biology) (18:43) Autonomy (26:40) Autonomy Benchmarks (29:53) Cyber (31:27) Ship It Anyway (33:40) You Are Not Ready --- First published: February 10th, 2026 Source: https://www.lesswrong.com/posts/togCQtFtfdF23xGNS/claude-opus-4-6-system-card-part-2-frontier-alignment --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    37분
  3. 2일 전

    “Claude Opus 4.6: System Card Part 1: Mundane Alignment and Model Welfare” by Zvi

    Claude Opus 4.6 is here. It was built with and mostly evaluated by Claude. Their headline pitch includes: 1M token context window (in beta) with State of the art retrieval performance. Improved abilities on a range of everyday work tasks. Model is improved. State of the art on some evaluations, including Terminal-Bench 2.0, HLE and a very strong lead in GDPval-AA. Claude Code now has an experimental feature called Agent Teams. Claude Code with Opus 4.6 has a new fast (but actually expensive) mode. Upgrades to Claude in Excel and the release of Claude in PowerPoint. Other notes: Price remains $5/$25, the same as Opus 4.5, unless you go ultra fast. There is now a configurable ‘effort’ parameter with four settings. Refusals for harmless requests with rich context are down to 0.04%. Data sources are ‘all of the above,’ including the web crawler (that they insist won’t cross CAPTCHAs or password protected pages) and other public data, various non-public data sources, data from customers who opt-in to that and internally generated data. They use ‘several’ data filtering methods. Thinking mode gives better [...] --- Outline: (03:45) A Three Act Play (04:57) Safety Not Guaranteed (10:53) Pliny Can Still Jailbreak Everything (12:48) Transparency Is Good: The 212-Page System Card (13:53) Mostly Harmless (17:45) Mostly Honest (19:01) Agentic Safety (20:27) Prompt Injection (23:07) Key Alignment Findings (33:48) Behavioral Evidence (6.2) (38:40) Reward Hacking and 'Overly Agentic Actions' (40:37) Metrics (6.2.5.2) (42:40) All I Did It All For The GUI (43:58) Case Studies and Targeted Evaluations Of Behaviors (6.3) (44:19) Misrepresenting Tool Results (45:09) Unexpected Language Switching (46:12) The Ghost of Jones Foods (47:54) Loss of Style Points (48:54) White Box Model Diffing (49:13) Model Welfare --- First published: February 9th, 2026 Source: https://www.lesswrong.com/posts/sWsSncqMLKyGZA9Ar/claude-opus-4-6-system-card-part-1-mundane-alignment-and --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    56분
  4. 3일 전

    “Claude Code #4: From The Before Times” by Zvi

    Claude Opus 4.6 and agent swarms were announced yesterday. That's some big upgrades for Claude Code. OpenAI, the competition, offered us GPT-5.3-Codex, and this week gave us an app form of Codex that already has a million active users. That's all very exciting, and next week is going to be about covering that. This post is about all the cool things that happened before that, which we will be building upon now that capabilities have further advanced. This if from Before Times. Almost all of it still applies. I haven’t had much chance yet to work with Opus 4.6, but as far as I can tell you should mostly keep on doing what you were doing before that switch, only everything will work better. Maybe get a bit more ambitious. Agent swarms might be more of a technique shifter, but we need to give that some time. Table of Contents Claude Code and Cowork Offer Mundane Utility. The Efficient Market Hypothesis Is False. Inflection Point. Welcome To The Takeoff. Huh, Upgrades. Todos Become Tasks. I’m Putting Together A Team. Compact Problems. Code Yourself A [...] --- Outline: (01:02) Claude Code and Cowork Offer Mundane Utility (04:07) The Efficient Market Hypothesis Is False (07:26) Inflection Point (11:07) Welcome To The Takeoff (11:29) Huh, Upgrades (16:02) Todos Become Tasks (17:46) I'm Putting Together A Team (20:06) Compact Problems (20:53) Code Yourself A Date (24:20) Verification and Generation Are Distinct Skills (26:07) Skilling Up (34:12) AskUserQuestion (34:42) For Advanced Players (36:53) So They Quit Reading (37:24) Reciprocity Is The Key To Every Relationship (41:37) The Implementation Gap (45:04) The Lighter Side --- First published: February 6th, 2026 Source: https://www.lesswrong.com/posts/iwX2aJPKtyKAbLdip/claude-code-4-from-the-before-times --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    46분
  5. 2월 5일

    “AI #154: Claw Your Way To The Top” by Zvi

    Remember OpenClaw and Moltbook? One might say they already seem a little quaint. So earlier-this-week. That's the internet having an absurdly short attention span, rather than those events not being important. They were definitely important. They were also early. It is not quite time for AI social networks or fully unleashed autonomous AI agents. The security issues have not been sorted out, and reliability and efficiency aren’t quite there. There's two types of reactions to that. The wrong one is ‘oh it is all hype.’ The right one is ‘we’ll get back to this in a few months.’ Other highlights of the week include reactions to Dario Amodei's essay The Adolescence of Technology. The essay was trying to do many things for many people. In some ways it did a good job. In other ways, especially when discussing existential risks and those more concerned than Dario, it let us down. Everyone excited for the Super Bowl? Table of Contents Language Models Offer Mundane Utility. Piloting on the surface of Mars. Language Models Don’t Offer Mundane Utility. Judgment humans trust. Huh, Upgrades. OpenAI Codex has an app. AI [...] --- Outline: (01:13) Language Models Offer Mundane Utility (03:45) Language Models Don't Offer Mundane Utility (04:20) Huh, Upgrades (06:13) They Got Served, They Served Back, Now It's On (15:15) On Your Marks (18:42) Get My Agent On The Line (19:57) Deepfaketown and Botpocalypse Soon (23:14) Copyright Confrontation (23:47) A Young Lady's Illustrated Primer (24:24) Unprompted Attention (24:36) Get Involved (28:12) Introducing (28:40) State of AI Report 2026 (36:18) In Other AI News (40:45) Autonomous Killer Robots (42:11) Show Me the Money (44:46) Bubble, Bubble, Toil and Trouble (47:58) Quiet Speculations (48:54) Seb Krier Says Seb Krier Things (58:07) The Quest for Sane Regulations (58:24) Chip City (01:02:39) The Week in Audio (01:03:00) The Adolescence of Technology (01:03:49) I Won't Stand To Be Disparaged (01:08:31) Constitutional Conversation (01:10:04) Rhetorical Innovation (01:13:51) Don't Panic (01:16:23) Aligning a Smarter Than Human Intelligence is Difficult (01:17:41) People Are Worried About AI Killing Everyone (01:18:48) The Lighter Side --- First published: February 5th, 2026 Source: https://www.lesswrong.com/posts/AMLLKDzjohCNbrA6t/ai-154-claw-your-way-to-the-top --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    1시간 21분
  6. 2월 4일

    “Kimi K2.5” by Zvi

    I had to delay this a little bit, but the results are in and Kimi K2.5 is pretty good. Table of Contents Official Introduction. On Your Marks. Positive Reactions. Skeptical Reactions. Kimi Product Accounts. Agent Swarm. Who Are You? Export Controls Are Working. Where Are You Going? Safety Not Even Third. It's A Good Model, Sir. Official Introduction Introducing Kimi K2.5, Kimi.ai: Meet Kimi K2.5, Open-Source Visual Agentic Intelligence. Global SOTA on Agentic Benchmarks: HLE full set (50.2%), BrowseComp (74.9%) Open-source SOTA on Vision and Coding: MMMU Pro (78.5%), VideoMMMU (86.6%), SWE-bench Verified (76.8%) Code with Taste: turn chats, images & videos into aesthetic websites with expressive motion. Agent Swarm (Beta): self-directed agents working in parallel, at scale. Up to 100 sub-agents, 1,500 tool calls, 4.5× faster compared with single-agent setup. K2.5 is now live on http://kimi.com in chat mode and agent mode. K2.5 Agent Swarm in beta for high-tier users. For production-grade coding, you can pair K2.5 with Kimi Code. – API here. Tech blog here. Weights and code here. Wu Haoning (Kimi): We [...] --- Outline: (00:16) Official Introduction (03:16) On Your Marks (06:10) Positive Reactions (08:33) Skeptical Reactions (11:05) Kimi Product Accounts (11:39) Agent Swarm (13:06) Who Are You? (15:48) Export Controls Are Working (16:24) Where Are You Going? (19:47) Safety Not Even Third (20:55) It's A Good Model, Sir --- First published: February 4th, 2026 Source: https://www.lesswrong.com/posts/omSudRiFDvtNRrxZS/kimi-k2-5 --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    22분
  7. 2월 3일

    “Unless That Claw Is The Famous OpenClaw” by Zvi

    First we must covered Moltbook. Now we can double back and cover OpenClaw. Do you want a generally impowered, initiative-taking AI agent that has access to your various accounts and communicates and does things on your behalf? That depends on how well, safely, reliably and cheaply it works. It's not ready for prime time, especially on the safety side. That may not last for long. It's definitely ready for tinkering, learning and having fun, if you are careful not to give it access to anything you would not want to lose. Table of Contents Introducing Clawdbot Moltbot OpenClaw. Stop Or You’ll Shoot. One Simple Rule. Flirting With Personal Disaster. Flirting With Other Kinds Of Disaster. Don’t Outsource Without A Reason. OpenClaw Online. The Price Is Not Right. The Call Is Coming From Inside The House. The Everything Agent Versus The Particular Agent. Claw Your Way To The Top. Introducing Clawdbot Moltbot OpenClaw Many are kicking it up a notch or two. That notch beyond Clade Code was initially called Clawdbot. You hand over a computer and access [...] --- Outline: (00:43) Introducing Clawdbot Moltbot OpenClaw (02:02) Stop Or You'll Shoot (06:05) One Simple Rule (08:49) Flirting With Personal Disaster (15:50) Flirting With Other Kinds Of Disaster (16:58) Don't Outsource Without A Reason (19:07) OpenClaw Online (22:10) The Price Is Not Right (24:06) The Call Is Coming From Inside The House (25:40) The Everything Agent Versus The Particular Agent (27:31) Claw Your Way To The Top --- First published: February 3rd, 2026 Source: https://www.lesswrong.com/posts/aQKBMEvTj3Heidoir/unless-that-claw-is-the-famous-openclaw --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    30분
  8. 2월 2일

    “Welcome to Moltbook” by Zvi

    Moltbook is a public social network for AI agents modeled after Reddit. It was named after a new agent framework that was briefly called Moltbot, was originally Clawdbot and is now OpenClaw. I’ll double back to cover the framework soon. Scott Alexander wrote two extended tours of things going on there. If you want a tour of ‘what types of things you can see in Moltbook’ this is the place to go, I don’t want to be duplicative so a lot of what he covers won’t be covered here. At least briefly Moltbook was, as Simon Willison called it, the most interesting place on the internet. Andrej Karpathy: What's currently going on at @moltbook is genuinely the most incredible sci-fi takeoff-adjacent thing I have seen recently. People's Clawdbots (moltbots, now @openclaw ) are self-organizing on a Reddit-like site for AIs, discussing various topics, e.g. even how to speak privately. sure maybe I am “overhyping” what you see today, but I am not overhyping large networks of autonomous LLM agents in principle, that I’m pretty sure. Ross Douthat: I think you should spend some time on moltbook.com today. Today's mood. Would not go [...] --- Outline: (05:12) What Is Real? How Do You Define Real? (05:58) I Don't Really Know What You Were Expecting (09:08) Social Media Goes Downhill Over Time (10:45) I Don't Know Who Needs To Hear This But (14:33) Watch What Happens (19:22) Don't Watch What Happens (27:20) Watch What Didn't Happen (32:06) Pulling The Plug (39:10) Give Me That New Time Religion (41:34) This Time Is Different (42:18) People Catch Up With Events (48:51) What Could We Do About This? (52:52) Just Think Of The Potential (56:24) The Lighter Side --- First published: February 2nd, 2026 Source: https://www.lesswrong.com/posts/y66jnvmyJ4AFE4Z5h/welcome-to-moltbook --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    57분

평가 및 리뷰

5
최고 5점
2개의 평가

소개

Audio narrations of LessWrong posts by zvi

좋아할 만한 다른 항목