AI Safety Newsletter

Center for AI Safety

Narrations of the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. This podcast also contains narrations of some of our publications. ABOUT US The Center for AI Safety (CAIS) is a San Francisco-based research and field-building nonprofit. We believe that artificial intelligence has the potential to profoundly benefit the world, provided that we can develop and use it safely. However, in contrast to the dramatic progress in AI, many basic problems in AI safety have yet to be solved. Our mission is to reduce societal-scale risks associated with AI by conducting safety research, building the field of AI safety researchers, and advocating for safety standards. Learn more at https://safe.ai

  1. 4d ago

    AISN #75: Anthropic Releases Fable, the US Government Restricts it

    Also: Anthropic's proposal for the AI industry to collectively slow down. Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. In this edition, we look at Anthropic's release of its latest model, Fable 5, and the US government's subsequent order to restrict it. We also discuss Anthropic's recent call for the “option to slow or temporarily pause frontier AI development.” Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts. The US Government Restricts Fable Days After its Release On June 9, Anthropic released Claude Fable 5 to the public. The model is significantly more capable than previous releases; it is the highest-scoring model on the benchmark Humanity's Last Exam, achieving 53.3% compared with Claude Opus 4.8's score of 45.7%. Anthropic described Fable as having similar capabilities to Claude Mythos Preview—a model announced in April that the company deemed too good at finding cyber vulnerabilities to be safe for general release. Anthropic also made Mythos 5, a version of Fable without strict bio or cyber safeguards, available to a small number of trusted organizations. Fable 5, Anthropic's “Mythos-class” model with [...] --- Outline: (00:40) The US Government Restricts Fable Days After its Release (04:16) Anthropic Calls for Option to Slow AI Development (06:50) In Other News (06:54) Government (07:43) Industry (08:19) Civil Society --- First published: June 17th, 2026 Source: https://newsletter.safe.ai/p/aisn-75-anthropic-releases-fable --- Want more? Check out our ML Safety Newsletter for technical safety research. Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    10 min
  2. Jun 3

    AISN #74: The Pope’s Encyclical & AI Betrayal Could Deter Reckless AI Use

    Also: AI model solves a well-known open mathematical problem posed 80 years ago. Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. In this edition, we look at a new ethical framework for human-AI relationships, how the AI safety discussion has entered the political mainstream, and the Musk v. Altman trial. Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts. Pope Leo XIV Publishes Encyclical on AI Last week, Pope Leo XIV published an encyclical titled Magnifica Humanitas “On Safeguarding the Human Person in the Time of Artificial Intelligence.” The encyclical touched on concerns including unemployment and AI relationships. The publication discussed numerous potential impacts of AI on society, from job displacement and autonomous weapons to misinformation and interference in human relationships. However, the Pope did not object to the technology itself; rather, he said we can embrace technology while ensuring it is used responsibly. The encyclical warned of the potential for power concentration and called for broad participation in a discussion about the moral values that AI should be aligned with. The encyclical did not explicitly mention [...] --- Outline: (00:36) Pope Leo XIV Publishes Encyclical on AI (02:58) How AI Betrayal Could Deter Reckless AI Use (07:01) AI Solves Well-Known Open Mathematics Problem (09:29) In Other News (09:32) Government (10:30) Industry (11:12) Civil Society --- First published: June 3rd, 2026 Source: https://newsletter.safe.ai/p/aisn-74-the-popes-encyclical-and --- Want more? Check out our ML Safety Newsletter for technical safety research. Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    13 min
  3. May 21

    AISN #73: AI Safety Enters the Political Mainstream & Musk Loses OpenAI Lawsuit

    Also: Potential Government Oversight of AI Model Releases. Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. In this edition, we look at how the AI safety discussion has entered the political mainstream, a new ethical framework for human-AI relationships, and the Musk v. Altman trial. Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts. China and the US Discuss AI Safety With the release of Claude Mythos and GPT-5.5, AI cybersecurity and safety has rapidly become more visible in Washington DC. Most recently, U.S. and Chinese leaders met in Beijing to discuss AI safety. Leaving the summit on Friday, President Trump said that he and President Xi Jinping had “talked about possibly working together for guardrails” during the visit. This Tuesday, China's Ministry of Foreign Affairs also announced the country had agreed to “dialogue” with the U.S. on AI. U.S. officials say talks with China are possible because America leads on AI. Earlier in the week, U.S. treasury secretary Scott Bessent had said that the two superpowers would start discussing best practices to ensure that non-state actors [...] --- Outline: (00:34) China and the US Discuss AI Safety (03:17) New Framework for Human-AI Coexistence (06:43) Musk Loses Lawsuit Against OpenAI (11:08) In Other News (11:11) Government (12:03) Industry (12:43) Civil Society --- First published: May 21st, 2026 Source: https://newsletter.safe.ai/p/aisn-73-ai-safety-enters-the-political --- Want more? Check out our ML Safety Newsletter for technical safety research. Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    14 min
  4. May 1

    AISN #72: New Research on AI Wellbeing

    Also: Public sentiment towards AI worsens. Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. In this edition, we discuss a research paper on AI Wellbeing and which AI models are the happiest. We also take a look at the downward trend of public sentiment towards AI, as well as OpenAI's big week of product releases. Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts. CAIS Releases AI Wellbeing Research The Center for AI Safety published a research paper on AI wellbeing. At the Center of AI Safety (CAIS), we have just released “AI Wellbeing: Measuring and Improving the Functional Pleasure and Pain of AIs.” This research explores whether LLMs experience functional wellbeing–behavioral signatures that functionally resemble positive or negative welfare signals in sentient beings. What activities produce high and low wellbeing? Through the testing of 56 large language models, we identified patterns in the types of actions and behaviors that the LLMs seemed to prefer or dislike, which we defined as “functional wellbeing.” Positive personal interaction and creative work topped the list of what measured high functional wellbeing [...] --- Outline: (00:34) CAIS Releases AI Wellbeing Research (05:16) OpenAI Releases Images 2.0 and GPT-5.5 (07:30) In Other News (07:33) Government (08:20) Industry (09:05) Civil Society --- First published: May 1st, 2026 Source: https://newsletter.safe.ai/p/aisn-72-new-research-on-ai-wellbeing --- Want more? Check out our ML Safety Newsletter for technical safety research. Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    10 min
  5. Apr 10

    AISN #71: Cyberattacks & Datacenter Moratorium Bill

    Also, updates on the Anthropic vs. Pentagon court case.. We’re Hiring. Opportunities at CAIS include: Head of Public Engagement, Principal, Special Projects, Program Manager, Operations Manager, and other roles. If you’re interested in working on reducing AI risk alongside a talented, mission-driven team, consider applying! AI Software Infrastructure Cyberattacks Recently, cyberattacks targeting the AI industry's software infrastructure stole private information potentially worth billions of dollars and inserted backdoors into developers’ computers. Google Threat Intelligence Group reported that one of the largest cyberattacks in this wave was carried out by North Korea-linked hackers. The stolen data may be worth billions. Hackers stole and auctioned private data from Mercor, an AI training data supplier for OpenAI and Anthropic which was recently valued at $10 billion. Mercor collects AI training data from a large number of experts, as well as highly sensitive personal and biometric data for identity verification. This attack not only comprises the data that Mercor sells, but also internal data that could be used to impersonate their hired experts. A person familiar with the situation stated that Mercor has paid the hackers’ requested ransom, although it remains unclear if the hackers intend to release or sell the data [...] --- Outline: (00:41) AI Software Infrastructure Cyberattacks (02:34) Datacenter Moratorium and Export Controls Bill (04:21) Anthropic v. Department of War Lawsuit (07:23) In Other News (07:26) Government (07:46) Industry (08:20) Civil Society --- First published: April 10th, 2026 Source: https://newsletter.safe.ai/p/aisn-71-cyberattacks-and-datacenter --- Want more? Check out our ML Safety Newsletter for technical safety research. Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    10 min
  6. Mar 24

    AISN #70: AI Layoffs and Automated Warfare

    Also, a new open letter advocating for pro-human values and control over AI development. Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. In this edition, we discuss AI automation and augmentation of warfare and technology jobs, as well as a new open letter outlining pro-human values in the face of AI development. Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts. We’re Hiring. We’re hiring an editor! Help us surface the most compelling stories in AI safety and shape how the world understands this fast-moving field. Other opportunities at CAIS include: Head of Public Engagement, Program Manager, Operations Associate, and other roles. If you’re interested in working on reducing AI risk alongside a talented, mission-driven team, consider applying! AI-Driven Layoffs Several large software companies such as Amazon and Meta are planning to cut tens of thousands of employees, citing increased productivity with AI. This continues a growing but contested trend of layoffs in sectors where AI performs best, such as software development and marketing. Layoffs affect almost half of some companies. Meta recently announced plans to let over [...] --- Outline: (00:58) AI-Driven Layoffs (03:14) AI Automation of Warfare (05:36) Pro-Human Open Letter (07:43) In Other News (07:47) Government (08:11) Industry --- First published: March 24th, 2026 Source: https://newsletter.safe.ai/p/ai-safety-newsletter-70-ai-layoffs --- Want more? Check out our ML Safety Newsletter for technical safety research. Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    10 min
  7. Mar 13

    AISN #69: Department of War, Anthropic, and National Security

    Also, Anthropic Removes a Core Safety Commitment. Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. In this edition, we discuss the conflicts between Anthropic and the Department of War and Anthropic's recent removal of a core safety commitment. Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts. We’re Hiring. We’re hiring an editor! Help us surface the most compelling stories in AI safety and shape how the world understands this fast-moving field. Other opportunities at CAIS include: Head of Public Engagement, Program Manager, Operations Associate, and other roles. If you’re interested in working on reducing AI risk alongside a talented, mission-driven team, consider applying! Pentagon Declares Anthropic a Supply Chain Risk to National Security Anthropic CEO Dario Amodei (left) and US Secretary of War Pete Hegseth (right) Thursday, March 5th, the US Department of War (DoW) announced that Anthropic is designated a “supply chain risk,” meaning that Anthropic products cannot be used by the DoW or in any defense contracts. This comes after several weeks of tensions between the two organizations over whether Anthropic models would be used for [...] --- Outline: (00:59) Pentagon Declares Anthropic a Supply Chain Risk to National Security (05:51) Anthropic Drops Core Safety Commitment (07:22) Opportunity for Experienced Researchers: AI and Society Fellowship (07:58) In Other News (08:02) Government (09:07) Industry (10:17) Civil Society --- First published: March 13th, 2026 Source: https://newsletter.safe.ai/p/ai-safety-newsletter-69-department --- Want more? Check out our ML Safety Newsletter for technical safety research. Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    12 min
  8. Feb 2

    AISN #68: Moltbook Exposes Risky AI Behavior

    Plus: The Pentagon Accelerates AI and GPT-5.2 solves open mathematics problems.. Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. In this edition, we discuss the AI agent social network Moltbook, Pentagon's new “AI-First” strategy, and recent math breakthroughs powered by LLMs. Listen to the AI Safety Newsletter for free on Spotify or Apple Podcasts. We’re Hiring. We’re hiring an editor! Help us surface the most compelling stories in AI safety and shape how the world understands this fast-moving field. Other opportunities at CAIS include: Research Engineer, Research Scientist, Director of Development, Special Projects Associate, and Special Projects Manager. If you’re interested in working on reducing AI risk alongside a talented, mission-driven team, consider applying! Moltbook Sparks Safety Concerns Screencapture from Moltbook's home page. Source. Moltbook is a new social network for AI agents. From nearly the moment it went live, human observers have noted numerous troubling patterns in what's being posted. How Moltbook works. Moltbook is a Reddit-style social network built on a framework that lets personal AI assistants run locally and accept tasks via messaging platforms. Agents check Moltbook regularly (i.e., every [...] --- Outline: (01:04) Moltbook Sparks Safety Concerns (05:10) Pentagon Mandates AI-First Strategy (07:59) AI Solves Open Math Problems (10:41) In Other News (10:45) Government (11:31) Industry (13:06) Civil Society (14:52) Discussion about this post (14:56) Ready for more? --- First published: February 2nd, 2026 Source: https://newsletter.safe.ai/p/ai-safety-newsletter-68-moltbook --- Want more? Check out our ML Safety Newsletter for technical safety research. Narrated by TYPE III AUDIO. --- Images from the article: Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

    15 min

Ratings & Reviews

About

Narrations of the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required. This podcast also contains narrations of some of our publications. ABOUT US The Center for AI Safety (CAIS) is a San Francisco-based research and field-building nonprofit. We believe that artificial intelligence has the potential to profoundly benefit the world, provided that we can develop and use it safely. However, in contrast to the dramatic progress in AI, many basic problems in AI safety have yet to be solved. Our mission is to reduce societal-scale risks associated with AI by conducting safety research, building the field of AI safety researchers, and advocating for safety standards. Learn more at https://safe.ai

You Might Also Like