Explore TrailBlazer, a novel algorithm for sample-efficient Monte-Carlo planning that optimizes decision-making in Markov decision processes by focusing on n...
Level: advanced
By Jean-Bastien Grill, Michal Valko, Rémi Munos
Category: research