Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.
Our music is by Micah Rubin (Producer) and John Lisi (Composer).
For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.
정보
- 프로그램
- 주기매월 업데이트
- 발행일2024년 8월 2일 오후 5:02 UTC
- 길이1시간
- 에피소드10
- 등급전체 연령 사용가