Spectral Geometry of LoRA Adapters Encodes Training Objective and Predicts Harmful Compliance
This research explores how the spectral geometry of LoRA adapters reveals training intent and predicts harmful behavior in LLMs, while highlighting critical ...