A Comprehensive Evaluation on Quantization Techniques for Large Language Models

This research evaluates advanced quantization techniques for large language models, focusing on pre-quantization strategies, rotation-based error mitigation,...

Level: advanced

By Unknown

Category: research