This research introduces GazeQwen, a novel architecture that enhances multimodal LLMs by integrating gaze awareness through lightweight hidden-state modulati...
Level: advanced
By Trong Thang Pham
Category: research