Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense

This research investigates critical vulnerabilities in large reasoning models where adversarial distractions significantly reduce accuracy. It proposes a rob...

Level: advanced

By Unknown

Category: research