This research reveals how human evaluators' preexisting beliefs systematically skew their assessment of AI logical reasoning, highlighting a critical gap in ...
Level: advanced
By Xi Cun, Jifan Ren, Asha Huang, Siyu Li, Ruzhen Song
Category: research