AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression

AngelSlim introduces a training-aligned speculative decoding framework that achieves significant throughput gains through dynamic sparsity pruning and modali...

Level: advanced

By Rui Cen and 12 other authors

Category: research