Suspicious Alignment of SGD: A Fine-Grained Step Size Condition Analysis

This research establishes a theoretical mechanism for instability in deep learning training by analyzing fine-grained step size conditions and subspace dynam...

Level: expert

By Shenyang Deng, Boyao Liao, Zhuoli Ouyang, Tianyu Pang, Minhak Song, Yaoqing Yang

Category: research