VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

VOGUE introduces a novel approach to multimodal reasoning by leveraging visual uncertainty to guide exploration, significantly boosting accuracy on complex b...

Level: advanced

By Unknown

Category: research