AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression
AngelSlim introduces a training-aligned speculative decoding framework that achieves significant throughput gains through dynamic sparsity pruning and modali...