HyperAI

Atari Games On Atari 2600 Seaquest

المقاييس

Score

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجScore
playing-atari-with-deep-reinforcement1740
soft-actor-critic-for-discrete-action211.6
mastering-atari-go-chess-and-shogi-by999976.52
deep-reinforcement-learning-with-double-q4216.7
asynchronous-methods-for-deep-reinforcement2300.2
a-distributional-perspective-on-reinforcement266434.0
prioritized-experience-replay26357.8
generalized-data-distribution-iteration943910
prioritized-experience-replay25463.7
self-imitation-learning2456.5
dna-proximal-policy-optimization-with-a-dual4146
deep-reinforcement-learning-with-double-q14498.0
evolution-strategies-as-a-scalable1390.0
recurrent-rational-networks7460
discrete-latent-space-world-models-for635
gdi-rethinking-what-makes-reinforcement943910
dueling-network-architectures-for-deep37361.6
deep-exploration-via-bootstrapped-dqn9083.1
human-level-control-through-deep5286.0
mean-actor-critic1703.4
the-arcade-learning-environment-an-evaluation5132.4
agent57-outperforming-the-atari-human999997.63
curl-contrastive-unsupervised-representations408
recurrent-rational-networks6603
noisy-networks-for-exploration16754
playing-atari-with-six-neurons320
distributed-deep-reinforcement-learning-learn1832
increasing-the-action-gap-new-operators-for8670.5
dueling-network-architectures-for-deep16452.7
deep-attention-recurrent-q-network7263
the-arcade-learning-environment-an-evaluation664.8
dueling-network-architectures-for-deep931.6
iq-learn-inverse-soft-q-learning-for-
increasing-the-action-gap-new-operators-for13230.74
implicit-quantile-networks-for-distributional30140
value-prediction-network5628
النموذج 37675.5
generalized-data-distribution-iteration1000000
deep-reinforcement-learning-with-double-q5860.6
mastering-atari-with-discrete-world-models-17480
train-a-real-world-local-path-planner-in-one29278.6
dueling-network-architectures-for-deep50254.2
evolving-simple-programs-for-playing-atari724
decision-transformer-reinforcement-learning2.4
generalized-data-distribution-iteration1000000
asynchronous-methods-for-deep-reinforcement1326.1
improving-computational-efficiency-in-visual561.2
impala-scalable-distributed-deep-rl-with1753.20
distributional-reinforcement-learning-with-18268
learning-values-across-many-orders-of10932.3
deep-reinforcement-learning-with-double-q1431.2
asynchronous-methods-for-deep-reinforcement2355.4
policy-optimization-with-penalized-point1807.47
massively-parallel-methods-for-deep10145.9
online-and-offline-reinforcement-learning-by999659.18
recurrent-experience-replay-in-distributed999996.7
distributed-prioritized-experience-replay392952.3