Misalignment Bounty: Crowdsourcing AI Agent Misbehavior

Explore the Misalignment Bounty, a crowdsourced initiative utilizing structured diagnostic protocols to systematically identify and evaluate unsafe behaviors...

Level: advanced

By Rustem Turtayev, Natalia Fedorova, Oleg Serikov, Sergey Koldyba, Lev Avagyan, Dmitrii Volkov

Category: discussion