HyperAI

Atari Games On Atari 2600 Frostbite

المقاييس

Score

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجScore
learning-values-across-many-orders-of3469.6
asynchronous-methods-for-deep-reinforcement197.6
increasing-the-action-gap-new-operators-for3248.96
dna-proximal-policy-optimization-with-a-dual320
dueling-network-architectures-for-deep2332.4
deep-reinforcement-learning-with-double-q797.4
dueling-network-architectures-for-deep7413.0
incentivizing-exploration-in-reinforcement507.0
recurrent-experience-replay-in-distributed315456.4
evolution-strategies-as-a-scalable370.0
the-arcade-learning-environment-an-evaluation216.9
deep-reinforcement-learning-with-double-q4038.4
exploration-a-study-of-count-based5214.0
mastering-atari-go-chess-and-shogi-by631378.53
prioritized-experience-replay3510.0
deep-exploration-via-bootstrapped-dqn2181.4
curl-contrastive-unsupervised-representations924
prioritized-experience-replay4380.1
asynchronous-methods-for-deep-reinforcement190.5
value-prediction-network3811
النموذج 21180.9
a-distributional-perspective-on-reinforcement3965.0
the-arcade-learning-environment-an-evaluation270.5
mastering-atari-with-discrete-world-models-111384
online-and-offline-reinforcement-learning-by374769.76
human-level-control-through-deep328.3
dueling-network-architectures-for-deep4672.8
increasing-the-action-gap-new-operators-for2305.82
implicit-quantile-networks-for-distributional4324
deep-reinforcement-learning-with-double-q496.1
fully-parameterized-quantile-function-for214060
massively-parallel-methods-for-deep426.6
self-imitation-learning6289.8
playing-atari-with-six-neurons300
deep-reinforcement-learning-with-double-q1448.1
model-free-episodic-control-with-state2394
dueling-network-architectures-for-deep1683.3
asynchronous-methods-for-deep-reinforcement180.1
count-based-exploration-in-feature-space-for2770.1
distributed-prioritized-experience-replay9328.6
distributional-reinforcement-learning-with-14384
generalized-data-distribution-iteration11330
noisy-networks-for-exploration2923
gdi-rethinking-what-makes-reinforcement10485
evolving-simple-programs-for-playing-atari782
train-a-real-world-local-path-planner-in-one8616.4
policy-optimization-with-penalized-point316.87
impala-scalable-distributed-deep-rl-with317.75
soft-actor-critic-for-discrete-action59.4
generalized-data-distribution-iteration11330
count-based-exploration-in-feature-space-for1394.3
generalized-data-distribution-iteration10485
agent57-outperforming-the-atari-human541280.88