LessWrong (30+ Karma)

LessWrong

Audio narrations of LessWrong posts.

  1. 8h ago

    “China won’t win the AI race but would it be much worse if it did?” by Chastity Ruth

    It seems to me accepted wisdom in the West that the US owned labs must “beat” the Chinese labs in the race for AGI/ASI. Even those who don’t think there will be a winner, that essentially the race is to see which country's AI will kill/disempower us first, seem to believe that if there has to be a winner then better it be the US labs. (I haven't seen a survey, so I could be way off here.) For those in power and those heavily financially invested in the labs this of course makes sense. But what about for the rest of us? I had myself accepted the wisdom, in the nebulous way you consider propositions in domains where you don’t have any expertise. The CCP is repressive, I thought. They are an authoritarian country and therefore fundamentally a malign actor. Of course they mustn’t win. Then I heard Victor Shih, Chinese political system expert, talk on Dwarkesh's podcast: “For the Chinese government, they’re very afraid that some actor—outside, but even inside the Party—is going to use it [AI] as a tool to usurp the Party's power. So they want to know that they have a way of [...] --- Outline: (02:45) Who is more willing to slow AI development? (06:41) Who is more likely to hit the brakes before a rogue ASI breakout? (13:17) Who is developing models more responsibly? (17:07) Who is using models more responsibly? (20:30) Who is more likely to lead us to paradise? (21:45) Race dynamics --- First published: June 3rd, 2026 Source: https://www.lesswrong.com/posts/nmpzH6sLLtKsQhSPM/china-won-t-win-the-ai-race-but-would-it-be-much-worse-if-it --- Narrated by TYPE III AUDIO.

    24 min
  2. 17h ago

    “Claude Opus 4.8: Capabilities and Reactions” by Zvi

    You need a lot of data points to understand a new model, and what you have. Trying to gauge from a few benchmarks is misleading. But if you have dozens of them, from a variety of sources, and you put them together with the model card tests and the model welfare information, you can start to form a consistent pattern. Trying to gauge reactions requires volume and calibration, now more than ever, because people are definitively nuts, or at least draw global conclusions from local data. There will always be people saying that the new model is bad, or the service got bad, or that it got bad in a particular way it clearly got good. I definitely notice the people saying 4.8 is a terrible model, despite this being obviously not true. And others will say it's great, again regardless of the underlying value. But with the reaction threads and good calibration, you can pick out the patterns. The model welfare information helps a lot, too. You are dealing with a mind that has a bunch of characteristics that all make sense together. This helps you make that sense. Self-Portrait by Opus 4.8, rendered [...] --- Outline: (01:30) The Official Pitch (02:15) But Wait There's More (06:37) It's A Good Model, Sir (08:04) Official Benchmarks (Including System Card Section 8) (15:41) Other People's Benchmarks (21:34) Your Regularly Scheduled Jailbreak (22:45) Every.To Is Really Into Opus 4.8 (26:36) Miscellaneous Positive Reactions (28:45) Haters Gonna Hate (28:58) Just The Tasks, Ma'am (29:24) It's Greek To Me (29:52) Honesty (38:13) Sycophancy (41:24) In A Trenchcoat (42:53) Don't Let AIs Edit Your Writing (48:21) Some Say It Is Judgy (50:50) You Have Not Been A Good User (51:48) Laziness (52:46) Code (58:29) Wet Versus Dry (59:54) Intelligence (01:01:25) Silly Wabbits (01:02:30) A Model Welfare Addendum (01:06:27) Putting It All Together --- First published: June 2nd, 2026 Source: https://www.lesswrong.com/posts/AfLGv6u9eZNuFHb4c/claude-opus-4-8-capabilities-and-reactions --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    1h 7m
  3. 18h ago

    “My favorite depiction of utopia” by Caleb Biddulph

    For those who are trying to bring about a glorious transhuman utopia with the help of hopefully-aligned ASI, I think it's worth thinking explicitly about what utopia might actually look like and where it's likely to fall short. To that end, some have helpfully written depictions of utopian (or utopia-adjacent) worlds: The Adventure, Just another day in utopia, The Culture, The Gentle Seduction, The Gentle Romance, Machines of Loving Grace, Friendship is Optimal, Dath Ilan, The Maker of MIND, Failed Utopia #4-2. Unfortunately, the best utopian story I've ever read is also a massive spoiler, since it appears at the very end of a much longer story (see below for the title and author): Worth the Candle by Alexander Wales Inspired by this tweet[1] and with the original author's permission, I adapted the epilogue of that story so it can be enjoyed without 1.5 million words of context! What I love most about this depiction is its exploration of the inherent imperfection of utopia: even when you have literally unlimited power, flaws will remain, and some (many?) people will even prefer the pre-utopia world. The primary purpose of this adaptation is to recontextualize the epilogue so it's accessible and [...] The original text contained 1 footnote which was omitted from this narration. --- First published: June 3rd, 2026 Source: https://www.lesswrong.com/posts/to9cSGgD6nALByKjg/my-favorite-depiction-of-utopia --- Narrated by TYPE III AUDIO.

    58 min
  4. 1d ago

    “Why Even Experts Don’t Know What to Do About AI Risk” by Luc Brinkman, plex

    AI Safety veteran Holden Karnofsky thinks there's a 49% chance his actions are making things worse.[1] In 2025, Jesse Clifton even stepped down as the executive director of the Center on Long-Term risk because of similar reasons. Even top AI Safety strategists don’t know what will make things better, and what will make things worse. Why is it so hard to improve humanity's odds? And what can you do to choose your actions? 1) Hidden Failure Lets You Fail Without Knowing It In AI Safety, impact is hard to measure, and thus lack of impact is often invisible. We call this "hidden failure". With hidden failure, projects fail to have a positive impact but the people doing the project don’t realise it. To understand where hidden failure comes from, it's useful to understand reasons why projects fail in general. These reasons fall on a spectrum: Wrong problem: You're addressing something with little influence on x-risk. For example, researching AI fairness when the core risk is misalignment.Wrong solution: Your solution doesn't solve the problem, even when competently executed. E.g. interpretability research that's technically novel but isn’t actually helpful.Poor execution: Your problem-solution set could be impactful but you're not executing your [...] --- Outline: (00:49) 1) Hidden Failure Lets You Fail Without Knowing It (02:44) 2) Why impact is harder than profit (03:24) 3) The pre-paradigmatic challenge The original text contained 8 footnotes which were omitted from this narration. --- First published: June 2nd, 2026 Source: https://www.lesswrong.com/posts/tRRkj249gdDL4mued/why-even-experts-don-t-know-what-to-do-about-ai-risk --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    5 min
  5. 1d ago

    “Agent Foundations Reminds Me of Continental Philosophy” by IanWS

    Nevertheless, I shall take advantage of your kindness in assuming we agree that a science cannot be conditioned upon empiricism. — Jacques Lacan, “The Subversion of the Subject and the Dialectic of Desire in the Freudian Unconscious”[1] Freud developed the first modern theory of the unconscious. His writings on drives, dreams and the id were instrumental in developing modern practices of psychology and neuroscience. Modern researchers are unlikely to leverage concepts like the superego or the oedipal complex, because we have been able to further our understanding of the mind through empirical work, which does not support many of Freud's specific claims. Freud pushed us in the right direction, but he lacked an empirical foundation to make precise claims. Freudian psychoanalysis mapped well to our narrative claims about the mind. Particularly coming out of the prudishness of late 19th century Europe, it was intuitive to learn we all had unconscious drives that did not track with societal norms. Stefan Zweig, writing about the austere Austro-Hungarian norms of his youth, wrote: The more a woman was supposed to appear as a “lady,” the less her natural forms were permitted to be discernible; fundamentally, fashion, with this deliberate guiding principle of [...] The original text contained 2 footnotes which were omitted from this narration. --- First published: June 2nd, 2026 Source: https://www.lesswrong.com/posts/GykcHo2rfex3YHPuD/agent-foundations-reminds-me-of-continental-philosophy --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    11 min
  6. 1d ago

    “Announcing the ARC White-Box Estimation Challenge” by Jacob_Hilton

    ARC has teamed up with AIcrowd to launch the ARC White-Box Estimation Challenge, a contest to improve upon our estimation algorithms for random MLPs. The warm-up round begins this week, and later rounds will have a total prize pool of at least $100,000. We are very grateful to Sharada Mohanty, Sneha Nanavati, Dipam Chakraborty and everyone else at AIcrowd for working with us to host this contest, as well as to Paul Rosu for testing the contest and to Harshita Khera for operational support. Introduction to the Challenge Our challenge follows the same setup as our recent paper on wide random MLPs: we consider MLPs with weights , defined by where the activation function is , applied coordinatewise. To begin with, we are fixing the width and the number of hidden layers , but we expect to change this setup in future rounds.[1] Contestants must design an algorithm that takes in a set of weights and produces an estimate for the expected output Algorithms will be evaluated on MLPs with randomly-sampled Gaussian weights. The goal is to achieve as low mean squared error as possible, subject to certain computational [...] --- Outline: (00:41) Introduction to the Challenge (01:58) Why run this contest? (03:39) Use of LLMs The original text contained 4 footnotes which were omitted from this narration. --- First published: June 2nd, 2026 Source: https://www.lesswrong.com/posts/Kben8CzS4awCwNw5c/announcing-the-arc-white-box-estimation-challenge --- Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    5 min

About

Audio narrations of LessWrong posts.

You Might Also Like