CoScale-RL: Efficient Post-Training by Co-Scaling Data and Computation

Explore CoScale-RL, a novel post-training framework that enhances Large Reasoning Models by co-scaling data and computation without extensive supervised fine...

Level: advanced

By Unknown

Category: research