10月13日
14 分鐘

Petri: An Open-Source AI Safety Auditing Tool

This episode introduce Petri (Parallel Exploration Tool for Risky Interactions), an open-source framework developed by Anthropic to accelerate AI safety research through automated auditing. Petri uses specialized AI auditor agents and LLM judges to test target models across diverse, multi-turn scenarios defined by human researchers via seed instructions.

單集網頁

節目

Intelligence Unbound
發佈時間

2025年10月13日下午12:00 [UTC]
長度

14 分鐘
年齡分級

兒少適宜

Petri: An Open-Source AI Safety Auditing Tool

資訊