APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs

This research introduces APreQEL, an adaptive mixed precision quantization mechanism designed to optimize Large Language Model deployment on resource-constra...

Level: advanced

By Meriem Bouzouad

Category: research