HyperAI超神経

Atari Games On Atari 2600 Centipede

評価指標

Score

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Score
gdi-rethinking-what-makes-reinforcement155830
self-imitation-learning7559.5
deep-exploration-via-bootstrapped-dqn4553.5
deep-reinforcement-learning-with-double-q4657.7
deep-reinforcement-learning-with-double-q5570.2
generalized-data-distribution-iteration155830
online-and-offline-reinforcement-learning-by874301.64
evolution-strategies-as-a-scalable7783.9
dueling-network-architectures-for-deep4881.0
asynchronous-methods-for-deep-reinforcement3306.5
first-return-then-explore1422628
asynchronous-methods-for-deep-reinforcement1997.0
a-distributional-perspective-on-reinforcement9646.0
recurrent-experience-replay-in-distributed599140.3
deep-reinforcement-learning-with-double-q3973.9
increasing-the-action-gap-new-operators-for4539.55
deep-reinforcement-learning-with-double-q3853.5
massively-parallel-methods-for-deep6296.9
モデル 194647.0
generalized-data-distribution-iteration195630
dueling-network-architectures-for-deep5409.4
mastering-atari-go-chess-and-shogi-by1159049.27
impala-scalable-distributed-deep-rl-with11049.75
evolving-simple-programs-for-playing-atari24708
gdi-rethinking-what-makes-reinforcement-11359533
noisy-networks-for-exploration7596
asynchronous-methods-for-deep-reinforcement3755.8
dueling-network-architectures-for-deep7561.4
distributed-prioritized-experience-replay12974
implicit-quantile-networks-for-distributional11561
train-a-real-world-local-path-planner-in-one3899.8
agent57-outperforming-the-atari-human412847.86
policy-optimization-with-penalized-point3315.44
learning-values-across-many-orders-of49065.8
mastering-atari-with-discrete-world-models-111883
the-arcade-learning-environment-an-evaluation125123
distributional-reinforcement-learning-with-112447
the-reactor-a-fast-and-sample-efficient-actor3422.0
dueling-network-architectures-for-deep7687.5
prioritized-experience-replay4463.2
the-arcade-learning-environment-an-evaluation8803.8
dna-proximal-policy-optimization-with-a-dual100194
increasing-the-action-gap-new-operators-for4225.18
prioritized-experience-replay3489.1
human-level-control-through-deep8309.0