MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces

This research evaluates how Multimodal Large Language Models like GPT-4o and Llama assess user interface perceptions, highlighting their potential to augment...

Level: advanced

By Unknown

Category: research