Pseudo-Quantized Actor-Critic Algorithm for Robustness to Noisy Temporal Difference Error
This research introduces a novel pseudo-quantized actor-critic algorithm leveraging control as inference to mitigate instability from noisy temporal differen...