EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control
EntroPIC introduces a novel Proportional-Integral control framework to stabilize entropy during long-term LLM training, offering theoretical convergence guar...