HyperAI

Atari Games On Atari 2600 Pong

المقاييس

Score

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجScore
prioritized-experience-replay18.9
evolution-strategies-as-a-scalable21.0
evolving-simple-programs-for-playing-atari20
noisy-networks-for-exploration21
discrete-latent-space-world-models-for20.2
recurrent-rational-networks18.13
mean-actor-critic10.6
dueling-network-architectures-for-deep20.9
mastering-atari-with-discrete-world-models-120
a-distributional-perspective-on-reinforcement20.9
dna-proximal-policy-optimization-with-a-dual19.7
playing-atari-with-deep-reinforcement21
human-level-control-through-deep18.9
massively-parallel-methods-for-deep16.7
online-and-offline-reinforcement-learning-by20.95
curl-contrastive-unsupervised-representations2.1
deep-reinforcement-learning-with-double-q19.5
increasing-the-action-gap-new-operators-for19.66
distributed-prioritized-experience-replay20.9
generalized-data-distribution-iteration21
asynchronous-methods-for-deep-reinforcement11.4
asynchronous-methods-for-deep-reinforcement10.7
train-a-real-world-local-path-planner-in-one21
النموذج 24-17.4
impala-scalable-distributed-deep-rl-with20.98
increasing-the-action-gap-new-operators-for19.76
asynchronous-methods-for-deep-reinforcement5.6
generalized-data-distribution-iteration21.0
recurrent-rational-networks18.04
the-arcade-learning-environment-an-evaluation-19
implicit-quantile-networks-for-distributional21
deep-reinforcement-learning-with-double-q18.0
distributed-deep-reinforcement-learning-learn20
agent57-outperforming-the-atari-human20.67
recurrent-experience-replay-in-distributed21.0
generalized-data-distribution-iteration21
dueling-network-architectures-for-deep21.0
self-imitation-learning20.9
dueling-network-architectures-for-deep18.8
dueling-network-architectures-for-deep20.9
deep-exploration-via-bootstrapped-dqn20.9
soft-actor-critic-for-discrete-action-20.98
prioritized-experience-replay20.6
deep-reinforcement-learning-with-double-q18.4
generalized-data-distribution-iteration21.0
deep-reinforcement-learning-with-double-q19.1
policy-optimization-with-penalized-point20.5
distributional-reinforcement-learning-with-121
the-arcade-learning-environment-an-evaluation21
mastering-atari-go-chess-and-shogi-by21.00
learning-values-across-many-orders-of20.6
decision-transformer-reinforcement-learning17.1