An Improved Model-Free Decision-Estimation Coefficient with Applications in Adversarial MDPs
This research introduces Dig-DEC, a novel model-free decision estimation method that significantly reduces regret in adversarial Markov decision processes co...