HyperAI

Atari Games On Atari 2600 Montezumas Revenge

Métriques

Score

Résultats

Résultats de performance de divers modèles sur ce benchmark

Nom du modèle
Score
Paper TitleRepository
R2D22061.3Recurrent Experience Replay in Distributed Reinforcement Learning-
GDI-H32500Generalized Data Distribution Iteration-
Best Learner10.7The Arcade Learning Environment: An Evaluation Platform for General Agents
MP-EB142Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
Sarsa-φ-EB2745.4Count-Based Exploration in Feature Space for Reinforcement Learning
DNA0DNA: Proximal Policy Optimization with a Dual Network Architecture
DQN+SR1778.8Count-Based Exploration with the Successor Representation
A3C FF (1 day) hs53Asynchronous Methods for Deep Reinforcement Learning
IMPALA (deep)0.00IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Prior+Duel hs24.0Deep Reinforcement Learning with Double Q-learning
A3C FF hs67Asynchronous Methods for Deep Reinforcement Learning
GDI-I33000GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning-
A2C+CoEX6635Contingency-Aware Exploration in Reinforcement Learning-
Advantage Learning0.42Increasing the Action Gap: New Operators for Reinforcement Learning
POP3D0Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
Nature DQN0Human level control through deep reinforcement learning
DDQN-PC3459Unifying Count-Based Exploration and Intrinsic Motivation
ASL DDQN0Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity-
Gorila84Massively Parallel Methods for Deep Reinforcement Learning
Sarsa-ε399.5Count-Based Exploration in Feature Space for Reinforcement Learning
0 of 50 row(s) selected.