GazeQwen: Lightweight Gaze-Conditioned LLM Modulation for Streaming Video Understanding

This research introduces GazeQwen, a novel architecture that enhances multimodal LLMs by integrating gaze awareness through lightweight hidden-state modulati...

Level: advanced

By Trong Thang Pham

Category: research