EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

EntroPIC introduces a novel Proportional-Integral control framework to stabilize entropy during long-term LLM training, offering theoretical convergence guar...

Level: expert

By Kai Yang and 7 other authors

Category: research