HyperAI

Atari Games On Atari 2600 Private Eye

Metriken

Score

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameScore
gdi-rethinking-what-makes-reinforcement15100
count-based-exploration-with-the-successor99.1
deep-reinforcement-learning-with-double-q-575.5
first-return-then-explore95756
self-imitation-learning661.2
deep-reinforcement-learning-with-double-q146.7
mastering-atari-with-discrete-world-models-12198
asynchronous-methods-for-deep-reinforcement206.9
a-distributional-perspective-on-reinforcement15095.0
recurrent-experience-replay-in-distributed5322.7
prioritized-experience-replay670.7
generalized-data-distribution-iteration15100
generalized-data-distribution-iteration15100
distributional-reinforcement-learning-with-1350
Modell 1586.0
count-based-exploration-with-neural-density8358.7
dueling-network-architectures-for-deep292.6
dueling-network-architectures-for-deep206.0
dueling-network-architectures-for-deep103.0
prioritized-experience-replay200.0
learning-values-across-many-orders-of286.7
the-arcade-learning-environment-an-evaluation684.3
deep-reinforcement-learning-with-double-q1277.6
unifying-count-based-exploration-and99.32
exploration-by-self-supervised-exploitation15089
impala-scalable-distributed-deep-rl-with98.50
evolution-strategies-as-a-scalable100.0
asynchronous-methods-for-deep-reinforcement421.1
distributed-prioritized-experience-replay49.8
policy-optimization-with-penalized-point79.67
increasing-the-action-gap-new-operators-for5276.16
implicit-quantile-networks-for-distributional200
evolving-simple-programs-for-playing-atari12702.2
human-level-control-through-deep1788.0
dna-proximal-policy-optimization-with-a-dual100
deep-reinforcement-learning-with-double-q207.9
count-based-exploration-with-neural-density206.0
the-arcade-learning-environment-an-evaluation1947.3
dueling-network-architectures-for-deep129.7
exploration-by-self-supervised-exploitation17313
curl-contrastive-unsupervised-representations105.2
mastering-atari-go-chess-and-shogi-by15299.98
exploration-by-self-supervised-exploitation4213
online-and-offline-reinforcement-learning-by100
large-scale-study-of-curiosity-driven3036.5
massively-parallel-methods-for-deep2598.6
train-a-real-world-local-path-planner-in-one349.7
agent57-outperforming-the-atari-human79716.46
noisy-networks-for-exploration279
exploration-by-random-network-distillation8666
asynchronous-methods-for-deep-reinforcement194.4
deep-exploration-via-bootstrapped-dqn1812.5