HyperAI

Atari Games On Atari 2600 Asterix

المقاييس

Score

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجScore
distributed-prioritized-experience-replay313305
النموذج 21332
evolution-strategies-as-a-scalable1440
prioritized-experience-replay22484.5
dueling-network-architectures-for-deep364200.0
evolving-simple-programs-for-playing-atari1880
deep-reinforcement-learning-with-double-q16837.0
noisy-networks-for-exploration28350
soft-actor-critic-for-discrete-action272
dna-proximal-policy-optimization-with-a-dual323965
recurrent-experience-replay-in-distributed999153.3
train-a-real-world-local-path-planner-in-one567640
the-arcade-learning-environment-an-evaluation987.3
learning-values-across-many-orders-of18919.5
impala-scalable-distributed-deep-rl-with300732.00
human-level-control-through-deep6012
asynchronous-methods-for-deep-reinforcement22140.5
generalized-data-distribution-iteration999999
dueling-network-architectures-for-deep17356.5
dueling-network-architectures-for-deep375080.0
online-and-offline-reinforcement-learning-by862406.65
curl-contrastive-unsupervised-representations524.3
distributional-reinforcement-learning-with-1261025
prioritized-experience-replay31527
recurrent-rational-networks12621
fully-parameterized-quantile-function-for578388.5
massively-parallel-methods-for-deep3324.7
generalized-data-distribution-iteration759910
dueling-network-architectures-for-deep15840.0
deep-reinforcement-learning-with-double-q3170.5
النموذج 3121040
increasing-the-action-gap-new-operators-for19564.9
implicit-quantile-networks-for-distributional342016
recurrent-rational-networks18109
self-imitation-learning17984.2
a-distributional-perspective-on-reinforcement406211
deep-exploration-via-bootstrapped-dqn19713.2
mastering-atari-with-discrete-world-models-172311
asynchronous-methods-for-deep-reinforcement17244.5
the-reactor-a-fast-and-sample-efficient-actor205914.0
deep-reinforcement-learning-with-double-q364200.0
deep-reinforcement-learning-with-double-q4359.0
the-arcade-learning-environment-an-evaluation290700
mastering-atari-go-chess-and-shogi-by998425.00
asynchronous-methods-for-deep-reinforcement6723
dueling-network-architectures-for-deep28188.0
agent57-outperforming-the-atari-human991384.42
policy-optimization-with-penalized-point4310.67
increasing-the-action-gap-new-operators-for12852.08