Explore advanced strategies for scaling Reinforcement Learning training in Large Language Models, focusing on entropy regularization and exploration metrics ...
Level: advanced
By Machine Learning Department, Carnegie Mellon University
Category: research