HyperAI

Atari Games On Atari 2600 Kung Fu Master

Metriken

Score

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameScore
prioritized-experience-replay39581.0
deep-reinforcement-learning-with-double-q20882.0
dueling-network-architectures-for-deep29710.0
asynchronous-methods-for-deep-reinforcement40835.0
increasing-the-action-gap-new-operators-for32182.99
dueling-network-architectures-for-deep48375.0
asynchronous-methods-for-deep-reinforcement3046.0
generalized-data-distribution-iteration140440
fully-parameterized-quantile-function-for111138.5
deep-reinforcement-learning-with-double-q30207.0
deep-exploration-via-bootstrapped-dqn36733.3
learning-values-across-many-orders-of34393.0
evolving-simple-programs-for-playing-atari57400
policy-optimization-with-penalized-point33728
increasing-the-action-gap-new-operators-for34650.91
gdi-rethinking-what-makes-reinforcement-11666000
mastering-atari-with-discrete-world-models-162741
noisy-networks-for-exploration41672
massively-parallel-methods-for-deep20620.0
deep-reinforcement-learning-with-double-q26059.0
the-arcade-learning-environment-an-evaluation48854.5
the-arcade-learning-environment-an-evaluation19544
dueling-network-architectures-for-deep24288.0
prioritized-experience-replay31676.0
train-a-real-world-local-path-planner-in-one85182
dna-proximal-policy-optimization-with-a-dual110962
asynchronous-methods-for-deep-reinforcement28819.0
a-distributional-perspective-on-reinforcement48192.0
curl-contrastive-unsupervised-representations14280
distributional-reinforcement-learning-with-176642
online-and-offline-reinforcement-learning-by116726.96
impala-scalable-distributed-deep-rl-with43375.50
generalized-data-distribution-iteration1666665
agent57-outperforming-the-atari-human206845.82
implicit-quantile-networks-for-distributional73512
self-imitation-learning34449.2
mastering-atari-go-chess-and-shogi-by204824.00
deep-reinforcement-learning-with-double-q37484.0
dueling-network-architectures-for-deep34294.0
recurrent-experience-replay-in-distributed233413.3
distributed-prioritized-experience-replay97829.5
human-level-control-through-deep23270.0
Modell 4329151.0