Weight Patching: Toward Source-Level Mechanistic Localization in LLMs

This research introduces Weight Patching, a novel parameter-space intervention method that localizes LLM behavior to specific internal components by replacin...

Level: advanced

By Chenghao Sun

Category: research