LLMs Do Not Grade Essays Like Humans

This research investigates the misalignment between Large Language Models and human raters in automated essay scoring, revealing distinct biases in how model...

Level: advanced

By Jerin George Mathew, Sumayya Taher, Anindita Kundu, Denilson Barbosa

Category: research