This research addresses instability in Large Reasoning Models by introducing StepFlow, a test-time intervention that repairs internal information flow failur...
Level: advanced
By Xiaoyu Xu
Category: research