Spectral Geometry of LoRA Adapters Encodes Training Objective and Predicts Harmful Compliance

This research explores how the spectral geometry of LoRA adapters reveals training intent and predicts harmful behavior in LLMs, while highlighting critical ...

Level: advanced

By Roi Paul

Category: research