HyperAI초신경

Atari Games On Atari 2600 Demon Attack

평가 지표

Score

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Score
online-and-offline-reinforcement-learning-by143838.04
massively-parallel-methods-for-deep14880.1
asynchronous-methods-for-deep-reinforcement84997.5
evolution-strategies-as-a-scalable1166.5
agent57-outperforming-the-atari-human143161.44
dueling-network-architectures-for-deep72878.6
the-reactor-a-fast-and-sample-efficient-actor115154.0
prioritized-experience-replay71846.4
generalized-data-distribution-iteration787985
the-arcade-learning-environment-an-evaluation28158.8
asynchronous-methods-for-deep-reinforcement115201.9
dna-proximal-policy-optimization-with-a-dual97909
모델 130.0
deep-exploration-via-bootstrapped-dqn82610
dueling-network-architectures-for-deep60813.3
deep-reinforcement-learning-with-double-q12550.7
increasing-the-action-gap-new-operators-for70908.17
the-arcade-learning-environment-an-evaluation520.5
self-imitation-learning10140.5
mastering-atari-go-chess-and-shogi-by143964.26
playing-atari-with-six-neurons325
recurrent-experience-replay-in-distributed140002.3
increasing-the-action-gap-new-operators-for27153.48
learning-values-across-many-orders-of63644.9
deep-reinforcement-learning-with-double-q12149.4
mastering-atari-with-discrete-world-models-182263
generalized-data-distribution-iteration675530
impala-scalable-distributed-deep-rl-with132826.98
policy-optimization-with-penalized-point61147.33
implicit-quantile-networks-for-distributional128580
distributed-prioritized-experience-replay133086.4
human-level-control-through-deep9711.0
prioritized-experience-replay61277.5
gdi-rethinking-what-makes-reinforcement675530
dueling-network-architectures-for-deep56322.8
dueling-network-architectures-for-deep58044.2
evolving-simple-programs-for-playing-atari2387
distributional-reinforcement-learning-with-1121551
noisy-networks-for-exploration69311
deep-reinforcement-learning-with-double-q69803.4
a-distributional-perspective-on-reinforcement130955.0
모델 42230324
train-a-real-world-local-path-planner-in-one119773.9
deep-reinforcement-learning-with-double-q73371.3
curl-contrastive-unsupervised-representations834
asynchronous-methods-for-deep-reinforcement113308.4