BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards

BaNEL introduces a novel approach to generative modeling that leverages negative rewards to train effectively in sparse-reward environments, offering a scala...

Level: advanced

By Unknown

Category: research