[2510.10730] Provable Anytime Ensemble Sampling Algorithms in Nonlinear Contextual Bandits

This research introduces provable ensemble sampling algorithms for nonlinear contextual bandits, establishing rigorous regret bounds for GLM-ES and Neural-ES...

Level: expert

By Unknown

Category: research