HyperAI超神経

Atari Games On Atari 2600 Road Runner

評価指標

Score

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Score
curl-contrastive-unsupervised-representations6786.7
mastering-atari-go-chess-and-shogi-by613411.80
recurrent-experience-replay-in-distributed599246.7
distributional-reinforcement-learning-with-164262
a-distributional-perspective-on-reinforcement55839.0
impala-scalable-distributed-deep-rl-with57121.00
dueling-network-architectures-for-deep58549.0
self-imitation-learning57071.7
agent57-outperforming-the-atari-human243025.8
implicit-quantile-networks-for-distributional57900
deep-reinforcement-learning-with-double-q35215.0
generalized-data-distribution-iteration999999
the-arcade-learning-environment-an-evaluation67.7
deep-reinforcement-learning-with-double-q54630.0
deep-reinforcement-learning-with-double-q43156.0
generalized-data-distribution-iteration878600
dueling-network-architectures-for-deep69524.0
dna-proximal-policy-optimization-with-a-dual61713
soft-actor-critic-for-discrete-action305.3
train-a-real-world-local-path-planner-in-one56520
the-arcade-learning-environment-an-evaluation38725
deep-reinforcement-learning-with-double-q39544.0
mastering-atari-with-discrete-world-models-1203576
dueling-network-architectures-for-deep62151.0
モデル 2589.1
increasing-the-action-gap-new-operators-for52351.23
evolving-simple-programs-for-playing-atari8960
prioritized-experience-replay52264.0
learning-values-across-many-orders-of47770.0
dueling-network-architectures-for-deep44127.0
asynchronous-methods-for-deep-reinforcement73949.0
distributed-prioritized-experience-replay222234.5
evolution-strategies-as-a-scalable16590.0
asynchronous-methods-for-deep-reinforcement34216.0
deep-exploration-via-bootstrapped-dqn51500
human-level-control-through-deep18257.0
noisy-networks-for-exploration234352
improving-computational-efficiency-in-visual11794
asynchronous-methods-for-deep-reinforcement31769.0
prioritized-experience-replay57608.0
massively-parallel-methods-for-deep43079.8
gdi-rethinking-what-makes-reinforcement878600
online-and-offline-reinforcement-learning-by531097
policy-optimization-with-penalized-point44679.67