This research introduces scalable policy-based RL algorithms for POMDPs, addressing computational hurdles in continuous belief states through finite-state ap...
Level: advanced
By Unknown
Category: research