HyperAI

Atari Games On Atari 2600 Boxing

المقاييس

Score

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجScore
asynchronous-methods-for-deep-reinforcement37.3
النموذج 29.8
online-and-offline-reinforcement-learning-by100
gdi-rethinking-what-makes-reinforcement100
the-reactor-a-fast-and-sample-efficient-actor99.4
asynchronous-methods-for-deep-reinforcement59.8
the-arcade-learning-environment-an-evaluation100
train-a-real-world-local-path-planner-in-one99.6
self-imitation-learning99.6
agent57-outperforming-the-atari-human100
dueling-network-architectures-for-deep91.6
generalized-data-distribution-iteration100
curl-contrastive-unsupervised-representations4.8
dna-proximal-policy-optimization-with-a-dual99.9
the-arcade-learning-environment-an-evaluation44
distributed-prioritized-experience-replay100
mastering-atari-with-discrete-world-models-192
dueling-network-architectures-for-deep99.4
deep-reinforcement-learning-with-double-q70.3
implicit-quantile-networks-for-distributional99.8
prioritized-experience-replay95.6
increasing-the-action-gap-new-operators-for94.3
prioritized-experience-replay72.3
massively-parallel-methods-for-deep74.2
deep-reinforcement-learning-with-double-q73.5
dueling-network-architectures-for-deep77.3
recurrent-experience-replay-in-distributed98.5
deep-exploration-via-bootstrapped-dqn93.2
distributed-deep-reinforcement-learning-learn98
human-level-control-through-deep71.8
learning-values-across-many-orders-of99.3
distributional-reinforcement-learning-with-199.9
a-distributional-perspective-on-reinforcement97.8
policy-optimization-with-penalized-point97.23
deep-reinforcement-learning-with-double-q79.2
dueling-network-architectures-for-deep98.9
noisy-networks-for-exploration100
mastering-atari-go-chess-and-shogi-by100.00
deep-reinforcement-learning-with-double-q88.0
increasing-the-action-gap-new-operators-for93.94
evolving-simple-programs-for-playing-atari38.4
generalized-data-distribution-iteration100
impala-scalable-distributed-deep-rl-with99.96
evolution-strategies-as-a-scalable49.8
asynchronous-methods-for-deep-reinforcement33.7