Explore R-Horizon, a novel framework for evaluating and enhancing long-horizon reasoning in Large Reasoning Models through verified rewards and dynamic budge...
Level: advanced
By Unknown
Category: research