This research systematically analyzes how the order of pruning, knowledge distillation, and quantization impacts Large Language Model performance, revealing ...
Level: advanced
By Shivansh Chhawri, Rahul Mahadik, Suparna Rooj
Category: research