Attention Gathers, MLPs Compose: A Causal Analysis of an Action-Outcome Circuit in VideoViT
This research dissects the internal mechanics of VideoViT, revealing how attention heads gather evidence while MLPs compose concepts to distinguish action ou...