A Systematic Study of Compression Ordering for Large Language Models

This research systematically analyzes how the order of pruning, knowledge distillation, and quantization impacts Large Language Model performance, revealing ...

Level: advanced

By Shivansh Chhawri, Rahul Mahadik, Suparna Rooj

Category: research