HyperAI

Atari Games On Atari 2600 Double Dunk

المقاييس

Score

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجScore
distributional-reinforcement-learning-with-121.9
mastering-atari-go-chess-and-shogi-by23.94
self-imitation-learning21.5
dueling-network-architectures-for-deep0.1
massively-parallel-methods-for-deep-11.3
implicit-quantile-networks-for-distributional5.6
dueling-network-architectures-for-deep-12.5
deep-reinforcement-learning-with-double-q-0.3
impala-scalable-distributed-deep-rl-with-0.33
generalized-data-distribution-iteration24
prioritized-experience-replay18.5
mastering-atari-with-discrete-world-models-117
deep-reinforcement-learning-with-double-q-10.7
deep-reinforcement-learning-with-double-q-6.6
the-arcade-learning-environment-an-evaluation24
learning-values-across-many-orders-of-11.5
asynchronous-methods-for-deep-reinforcement0.1
human-level-control-through-deep-18.1
النموذج 19-16.0
dueling-network-architectures-for-deep-5.5
distributed-prioritized-experience-replay23.5
the-reactor-a-fast-and-sample-efficient-actor23.0
dueling-network-architectures-for-deep-0.8
deep-reinforcement-learning-with-double-q-6.0
deep-exploration-via-bootstrapped-dqn3
evolving-simple-programs-for-playing-atari2
online-and-offline-reinforcement-learning-by23.91
policy-optimization-with-penalized-point-7.89
the-arcade-learning-environment-an-evaluation-13.1
gdi-rethinking-what-makes-reinforcement24
asynchronous-methods-for-deep-reinforcement-0.1
dna-proximal-policy-optimization-with-a-dual-1.3
train-a-real-world-local-path-planner-in-one0.1
increasing-the-action-gap-new-operators-for-2.51
asynchronous-methods-for-deep-reinforcement0.1
prioritized-experience-replay16.0
increasing-the-action-gap-new-operators-for-0.15
generalized-data-distribution-iteration24
agent57-outperforming-the-atari-human23.93
recurrent-experience-replay-in-distributed23.7
noisy-networks-for-exploration1
a-distributional-perspective-on-reinforcement2.5
evolution-strategies-as-a-scalable0.2