How to Explore to Scale RL Training of LLMs on Hard Problems?

Explore advanced strategies for scaling Reinforcement Learning training in Large Language Models, focusing on entropy regularization and exploration metrics ...

Level: advanced

By Machine Learning Department, Carnegie Mellon University

Category: research