HyperAI

Atari Games On Atari 2600 Bowling

المقاييس

Score

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجScore
increasing-the-action-gap-new-operators-for57.41
gdi-rethinking-what-makes-reinforcement201.9
deep-reinforcement-learning-with-double-q69.6
dna-proximal-policy-optimization-with-a-dual181
asynchronous-methods-for-deep-reinforcement41.8
evolving-simple-programs-for-playing-atari85.8
generalized-data-distribution-iteration205.2
impala-scalable-distributed-deep-rl-with59.92
dueling-network-architectures-for-deep65.5
train-a-real-world-local-path-planner-in-one62.4
deep-reinforcement-learning-with-double-q50.4
distributed-prioritized-experience-replay17.6
implicit-quantile-networks-for-distributional86.5
dueling-network-architectures-for-deep65.7
distributional-reinforcement-learning-with-177.2
asynchronous-methods-for-deep-reinforcement35.1
dueling-network-architectures-for-deep68.1
rudder-return-decomposition-for-delayed179
increasing-the-action-gap-new-operators-for71.59
massively-parallel-methods-for-deep54
self-imitation-learning31.1
prioritized-experience-replay52
policy-optimization-with-penalized-point38.99
mastering-atari-with-discrete-world-models-149
generalized-data-distribution-iteration201.9
first-return-then-explore260
dueling-network-architectures-for-deep46.7
fully-parameterized-quantile-function-for102.3
deep-reinforcement-learning-with-double-q50.4
deep-reinforcement-learning-with-double-q56.5
a-distributional-perspective-on-reinforcement81.8
online-and-offline-reinforcement-learning-by131.65
learning-values-across-many-orders-of102.1
the-reactor-a-fast-and-sample-efficient-actor81.0
evolution-strategies-as-a-scalable30
asynchronous-methods-for-deep-reinforcement36.2
agent57-outperforming-the-atari-human251.18
prioritized-experience-replay47.9
deep-exploration-via-bootstrapped-dqn60.2
human-level-control-through-deep42.4
mastering-atari-go-chess-and-shogi-by260.13
النموذج 4236.4
recurrent-experience-replay-in-distributed219.5
the-arcade-learning-environment-an-evaluation43.9