This research introduces a data-driven framework for selecting LLMs based on real-time energy and performance telemetry, optimizing decision latency and oper...
Level: advanced
By Daria Smirnova, Hamid Nasiri, Marta Adamska, Zhengxin Yu, Peter Garraghan
Category: research