Less Diverse, Less Safe

This research investigates how limiting diversity during test-time scaling in Large Language Models amplifies adversarial risks, introducing RefDiv as a crit...

Level: advanced

By Unknown

Category: discussion