Laminar: A Scalable Asynchronous RL Post-Training Framework

Laminar introduces a scalable asynchronous RL post-training framework designed to resolve GPU underutilization in large-scale training environments through t...

Level: advanced

By Unknown

Category: research