MATH-Beyond introduces a new benchmark designed to push reinforcement learning methods beyond standard base models, specifically targeting mathematical reaso...
Level: advanced
By Unknown
Category: research