This research investigates the misalignment between Large Language Models and human raters in automated essay scoring, revealing distinct biases in how model...
Level: advanced
By Jerin George Mathew, Sumayya Taher, Anindita Kundu, Denilson Barbosa
Category: research