This research introduces 'Thinking Traps' in long chain-of-thought reasoning and presents Trap-Aware Adaptive Restart (TAAR), a zero-fine-tuning method to de...
Level: advanced
By Kang Chen, Fan Yu, Junjie Nian, Shihan Zhao, Zhuoka Feng, Zijun Yao, Heng Wang, Minshen Yu, Yixin Cao
Category: research