Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning

Explore TrailBlazer, a novel algorithm for sample-efficient Monte-Carlo planning that optimizes decision-making in Markov decision processes by focusing on n...

Level: advanced

By Jean-Bastien Grill, Michal Valko, Rémi Munos

Category: research