This research establishes a theoretical mechanism for instability in deep learning training by analyzing fine-grained step size conditions and subspace dynam...
Level: expert
By Shenyang Deng, Boyao Liao, Zhuoli Ouyang, Tianyu Pang, Minhak Song, Yaoqing Yang
Category: research