SpecQuant: Spectral Decomposition and Adaptive Truncation for Ultra-Low-Bit LLMs Quantization

SpecQuant introduces a novel two-stage quantization framework leveraging spectral decomposition and Fourier-domain analysis to enable ultra-low-bit LLM deplo...

Level: advanced

By Zhixiong Zhao and 6 other authors

Category: research