Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO
This research exposes critical vulnerabilities in decentralized Group Relative Policy Optimization (GRPO) through novel adversarial token injection attacks. ...
Level: advanced
By Nikolay Blagoev, Oguzhan Ersoy, Lydia Yiyu Chen