HyperAI초신경

Atari Games On Atari 2600 Ms Pacman

평가 지표

Score

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Score
asynchronous-methods-for-deep-reinforcement653.7
evolving-simple-programs-for-playing-atari2568
모델 31227.0
human-level-control-through-deep2311.0
gdi-rethinking-what-makes-reinforcement11536
the-arcade-learning-environment-an-evaluation1691.8
curl-contrastive-unsupervised-representations1492.8
noisy-networks-for-exploration5546
model-free-episodic-control-with-state8530.4004
fully-parameterized-quantile-function-for7631.9
policy-optimization-with-penalized-point1683.87
online-and-offline-reinforcement-learning-by70659.76
distributed-prioritized-experience-replay11255.2
deep-reinforcement-learning-with-double-q3085.6
self-imitation-learning4025.1
the-arcade-learning-environment-an-evaluation22336
dueling-network-architectures-for-deep3327.3
deep-reinforcement-learning-with-double-q1241.3
distributional-reinforcement-learning-with-15821
dueling-network-architectures-for-deep2711.4
mastering-atari-with-discrete-world-models-15652
dueling-network-architectures-for-deep6283.5
implicit-quantile-networks-for-distributional6349
dna-proximal-policy-optimization-with-a-dual5894
asynchronous-methods-for-deep-reinforcement850.7
deep-reinforcement-learning-with-double-q1007.8
recurrent-experience-replay-in-distributed42281.7
mastering-atari-go-chess-and-shogi-by243401.10
a-distributional-perspective-on-reinforcement3415.0
train-a-real-world-local-path-planner-in-one4416
impala-scalable-distributed-deep-rl-with7342.32
increasing-the-action-gap-new-operators-for3917.55
value-prediction-network2689
prioritized-experience-replay6518.7
generalized-data-distribution-iteration11573
dueling-network-architectures-for-deep2250.6
generalized-data-distribution-iteration11536
agent57-outperforming-the-atari-human63994.44
prioritized-experience-replay1865.9
soft-actor-critic-for-discrete-action690.9
deep-reinforcement-learning-with-double-q1092.3
asynchronous-methods-for-deep-reinforcement594.4
deep-exploration-via-bootstrapped-dqn2983.3
increasing-the-action-gap-new-operators-for4065.8
massively-parallel-methods-for-deep1263.0
learning-values-across-many-orders-of4963.8
rainbow-combining-improvements-in-deep2570.2