This research introduces BreakFun, a targeted jailbreaking attack exploiting LLM schema adherence, and proposes an adversarial deconstruction guardrail to mi...
Level: advanced
By Amirkia Rafiei Oskooei, Mehmet S. Aktas
Category: discussion