Prior noop | 26357.8 | Prioritized Experience Replay | |
Recurrent Rational DQN Average | 7460 | Adaptive Rational Activations to Boost Deep Reinforcement Learning | |
Discrete Latent Space World Model (VQ-VAE) | 635 | Smaller World Models for Reinforcement Learning | - |
Bootstrapped DQN | 9083.1 | Deep Exploration via Bootstrapped DQN | |