This research introduces APreQEL, an adaptive mixed precision quantization mechanism designed to optimize Large Language Model deployment on resource-constra...
Level: advanced
By Meriem Bouzouad
Category: research