HyperAI超神経

Atari Games On Atari 2600 Montezumas Revenge

評価指標

Score

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Score
recurrent-experience-replay-in-distributed2061.3
generalized-data-distribution-iteration2500
the-arcade-learning-environment-an-evaluation10.7
incentivizing-exploration-in-reinforcement142
count-based-exploration-in-feature-space-for2745.4
dna-proximal-policy-optimization-with-a-dual0
count-based-exploration-with-the-successor1778.8
asynchronous-methods-for-deep-reinforcement53
impala-scalable-distributed-deep-rl-with0.00
deep-reinforcement-learning-with-double-q24.0
asynchronous-methods-for-deep-reinforcement67
gdi-rethinking-what-makes-reinforcement3000
contingency-aware-exploration-in6635
increasing-the-action-gap-new-operators-for0.42
policy-optimization-with-penalized-point0
human-level-control-through-deep0
unifying-count-based-exploration-and3459
train-a-real-world-local-path-planner-in-one0
massively-parallel-methods-for-deep84
count-based-exploration-in-feature-space-for399.5
exploration-by-self-supervised-exploitation7838
count-based-exploration-with-neural-density3705.5
prioritized-experience-replay51
distributed-prioritized-experience-replay2500.0
dueling-network-architectures-for-deep22.0
agent57-outperforming-the-atari-human9352.01
exploration-by-random-network-distillation8152
online-and-offline-reinforcement-learning-by2500
evolving-simple-programs-for-playing-atari0
asynchronous-methods-for-deep-reinforcement41
exploration-by-self-supervised-exploitation7212
exploration-a-study-of-count-based75
exploration-by-self-supervised-exploitation21565
count-based-exploration-with-the-successor1778.6
self-imitation-learning1100
large-scale-study-of-curiosity-driven2504.6
first-return-then-explore43791
unifying-count-based-exploration-and273.7
deep-exploration-via-bootstrapped-dqn100
モデル 40259
go-explore-a-new-approach-for-hard43763
generalized-data-distribution-iteration3000
deep-reinforcement-learning-with-double-q47.0
mastering-atari-with-discrete-world-models-181
noisy-networks-for-exploration57
mastering-atari-go-chess-and-shogi-by0.00
implicit-quantile-networks-for-distributional0
distributional-reinforcement-learning-with-10
increasing-the-action-gap-new-operators-for1.72
deep-reinforcement-learning-with-double-q42.0