02/08/2024
TẬP 10
1 GIỜ

#10: Stephen Casper on Technical and Sociotechnical AI Safety Research

Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.

Our music is by Micah Rubin (Producer) and John Lisi (Composer).

For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.

Trang web Tập phim

Chương trình

Center for AI Policy Podcast
Tần suất

Hằng tháng
Đã xuất bản

lúc 17:02 UTC 2 tháng 8, 2024
Thời lượng

1 giờ
Tập

10
Xếp hạng

Sạch

#10: Stephen Casper on Technical and Sociotechnical AI Safety Research

Thông Tin