HyperAI

Atari Games On Atari 2600 Beam Rider

Metriken

Score

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameScore
asynchronous-methods-for-deep-reinforcement24622.2
prioritized-experience-replay23384.2
impala-scalable-distributed-deep-rl-with32463.47
deep-reinforcement-learning-with-double-q9743.2
learning-values-across-many-orders-of8299.4
dueling-network-architectures-for-deep12164.0
human-level-control-through-deep6846.0
policy-optimization-with-penalized-point4549
the-arcade-learning-environment-an-evaluation929.4
playing-atari-with-deep-reinforcement5184
Modell 111743.0
mastering-atari-with-discrete-world-models-118646
massively-parallel-methods-for-deep3822.1
recurrent-independent-mechanisms5320
dueling-network-architectures-for-deep30276.5
asynchronous-methods-for-deep-reinforcement13235.9
evolving-simple-programs-for-playing-atari1341.6
the-arcade-learning-environment-an-evaluation6624.6
iq-learn-inverse-soft-q-learning-for-
online-and-offline-reinforcement-learning-by333077.44
agent57-outperforming-the-atari-human300509.8
soft-actor-critic-for-discrete-action432.1
deep-reinforcement-learning-with-double-q37412.2
a-distributional-perspective-on-reinforcement14074.0
noisy-networks-for-exploration23134
asynchronous-methods-for-deep-reinforcement22707.9
recurrent-experience-replay-in-distributed188257.4
deep-reinforcement-learning-with-double-q17417.2
dueling-network-architectures-for-deep13772.8
distributional-reinforcement-learning-with-134821
mastering-atari-go-chess-and-shogi-by454993.53
deep-reinforcement-learning-with-double-q8627.5
evolution-strategies-as-a-scalable744.0
dueling-network-architectures-for-deep14591.3
distributed-deep-reinforcement-learning-learn14900
generalized-data-distribution-iteration162100
mean-actor-critic6072
deep-exploration-via-bootstrapped-dqn23429.8
increasing-the-action-gap-new-operators-for13145.34
implicit-quantile-networks-for-distributional42776
generalized-data-distribution-iteration422890
train-a-real-world-local-path-planner-in-one26841.6
the-reactor-a-fast-and-sample-efficient-actor11033.4
increasing-the-action-gap-new-operators-for10054.58
dna-proximal-policy-optimization-with-a-dual20393
gdi-rethinking-what-makes-reinforcement162100
self-imitation-learning2366.2
distributed-prioritized-experience-replay63305.2
prioritized-experience-replay31181.3