Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO

This research exposes critical vulnerabilities in decentralized Group Relative Policy Optimization (GRPO) through novel adversarial token injection attacks. ...

Level: advanced

By Nikolay Blagoev, Oguzhan Ersoy, Lydia Yiyu Chen

Category: research