MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces
This research evaluates how Multimodal Large Language Models like GPT-4o and Llama assess user interface perceptions, highlighting their potential to augment...