Imagine in Space: Exploring the Frontier of Spatial Intelligence and Reasoning Efficiency in Vision Language Models

This research exposes critical inefficiencies in Vision Language Models regarding spatial reasoning and introduces the Imagery Driven Framework to optimize t...

Level: advanced

By Xiaoxing Lian, Aidong Yang, Jun Zhu, Peng Wang, Yue Zhang

Category: research