Explore MapTab, a new benchmark testing how Multimodal Large Language Models handle complex route planning under strict constraints. This research reveals cr...
Level: advanced
By Unknown
Category: research