This research introduces Weight Patching, a novel parameter-space intervention method that localizes LLM behavior to specific internal components by replacin...
Level: advanced
By Chenghao Sun
Category: research