BaNEL introduces a novel approach to generative modeling that leverages negative rewards to train effectively in sparse-reward environments, offering a scala...
Level: advanced
By Unknown
Category: research