8월 20일
시즌 1, 에피소드 2
57분

2 | AI Reliability and Humans Testing Language Models (Anastasios Angelopolous of LM Arena) - - 20-AUG-2025

How fast is AI really improving, and how do we know? What guarantees can we expect from AI systems to be robust and reliable? What is AGI and have we gotten there? Can AI systems show creativity or even sentience?

Join Anastasios Angelopoulos as he lays out his thoughts to these hard questions, as he and his partners build the world's most sophisticated ways to test LLMs as they get better faster than everyone expects.

Show Notes: Anastasios's Personal WebsiteConformal Prediction (Science of AI reliability)LM Arena (Humans testing LLMs)DeepSeek and DeepSeek R1

에피소드 웹페이지

프로그램

Variance
주기

매월 업데이트
발행일

2025년 8월 20일 오전 6:59 UTC
길이

57분
시즌

1
에피소드

2
등급

전체 연령 사용가

2 | AI Reliability and Humans Testing Language Models (Anastasios Angelopolous of LM Arena) - - 20-AUG-2025

정보