From Efficiency to Adaptivity: A Deeper Look at Adaptive Reasoning in Large Language Models
This research formalizes adaptive reasoning in LLMs as a control-augmented policy optimization problem, decoupling cognitive paradigms to enable fine-grained...