Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.
Our music is by Micah Rubin (Producer) and John Lisi (Composer).
For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.
信息
- 节目
- 频率一月一更
- 发布时间2024年8月2日 UTC 17:02
- 长度1 小时
- 单集10
- 分级儿童适宜