This research establishes a scaling-law framework to quantify how jailbreak success in LLMs correlates with attacker computational effort, revealing critical...
Level: advanced
By Xiangwen Wang
Category: discussion