Explore Týr-the-Pruner, an advanced framework for optimizing Large Language Model efficiency through global structural pruning and sparsity distribution alig...
Level: advanced
By Guanchen Li and 6 other authors
Category: research