From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models

Explore GISP, a novel global iterative structured pruning framework that optimizes Large Language Models through model-level loss aggregation, enabling effic...

Level: advanced

By Ziyan Wang and 8 other authors

Category: research