BreakFun: Jailbreaking LLMs via Schema Exploitation

This research introduces BreakFun, a targeted jailbreaking attack exploiting LLM schema adherence, and proposes an adversarial deconstruction guardrail to mi...

Level: advanced

By Amirkia Rafiei Oskooei, Mehmet S. Aktas

Category: discussion