Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.
Our music is by Micah Rubin (Producer) and John Lisi (Composer).
For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.
資訊
- 節目
- 頻率每月更新
- 發佈時間2024年8月2日 下午5:02 [UTC]
- 長度1 小時
- 集數10
- 年齡分級兒少適宜