Systematic Scaling Analysis of Jailbreak Attacks in Large Language Models

This research establishes a scaling-law framework to quantify how jailbreak success in LLMs correlates with attacker computational effort, revealing critical...

Level: advanced

By Xiangwen Wang

Category: discussion