Týr-the-Pruner: Structural Pruning LLMs via Global Sparsity Distribution Optimization

Explore Týr-the-Pruner, an advanced framework for optimizing Large Language Model efficiency through global structural pruning and sparsity distribution alig...

Level: advanced

By Guanchen Li and 6 other authors

Category: research