This episode introduce Petri (Parallel Exploration Tool for Risky Interactions), an open-source framework developed by Anthropic to accelerate AI safety research through automated auditing. Petri uses specialized AI auditor agents and LLM judges to test target models across diverse, multi-turn scenarios defined by human researchers via seed instructions.
資訊
- 節目
- 發佈時間2025年10月13日 下午12:00 [UTC]
- 長度14 分鐘
- 年齡分級兒少適宜
