#10: Stephen Casper on Technical and Sociotechnical AI Safety Research

Center for AI Policy Podcast

Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.

Our music is by Micah Rubin (Producer) and John Lisi (Composer).

For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.

To listen to explicit episodes, sign in.

Stay up to date with this show

Sign in or sign up to follow shows, save episodes, and get the latest updates.

Select a country or region

Africa, Middle East, and India

Asia Pacific

Europe

Latin America and the Caribbean

The United States and Canada