Robust AI Evaluation through Maximal Lotteries

This research introduces robust lotteries as a theoretically grounded alternative to traditional Bradley-Terry models, ensuring stable model evaluation under...

Level: expert

By Hadi Khalaf, Serena L. Wang, Daniel Halpern, Itai Shapira, Flavio du Pin Calmon, Ariel D. Procaccia

Category: research