DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression

This research introduces Delta-Aware Quantization (DAQ), a novel data-free framework designed to preserve critical post-training behaviors in large language ...

Level: advanced

By Xiaoming Yu

Category: research