This study reveals a critical disconnect between high intrinsic LLM scores and actual user outcomes in nutrition, challenging current benchmarking standards ...
Level: advanced
By Karen Jia-Hui Li, Simone Balloccu, Ondrej Dusek, Ehud Reiter
Category: research