HyperAI
HyperAI
الرئيسية
الأخبار
أحدث الأوراق البحثية
الدروس
مجموعات البيانات
الموسوعة
SOTA
نماذج LLM
لوحة الأداء GPU
الفعاليات
البحث
حول
العربية
HyperAI
HyperAI
Toggle sidebar
البحث في الموقع...
⌘
K
الرئيسية
SOTA
ألعاب أتاري
Atari Games On Atari 2600 Chopper Command
Atari Games On Atari 2600 Chopper Command
المقاييس
Score
النتائج
نتائج أداء النماذج المختلفة على هذا المعيار القياسي
Columns
اسم النموذج
Score
Paper Title
Repository
A2C + SIL
6710
Self-Imitation Learning
-
Advantage Learning
5431.36
Increasing the Action Gap: New Operators for Reinforcement Learning
-
GDI-H3
999999
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning
-
CGP
3580
Evolving simple programs for playing Atari games
-
Prior hs
4635.0
Prioritized Experience Replay
-
NoisyNet-Dueling
11477
Noisy Networks for Exploration
-
R2D2
986652.0
Recurrent Experience Replay in Distributed Reinforcement Learning
-
POP3D
6308.33
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
-
Ape-X
721851
Distributed Prioritized Experience Replay
-
DQN noop
6126.0
Deep Reinforcement Learning with Double Q-learning
-
Prior+Duel hs
8058.0
Deep Reinforcement Learning with Double Q-learning
-
DQN hs
5017.0
Deep Reinforcement Learning with Double Q-learning
-
MuZero
991039.70
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
-
IQN
16836
Implicit Quantile Networks for Distributional Reinforcement Learning
-
Gorila
3191.8
Massively Parallel Methods for Deep Reinforcement Learning
-
Reactor 500M
107779.0
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
-
DNA
31181
DNA: Proximal Policy Optimization with a Dual Network Architecture
-
FQF
876460.0
Fully Parameterized Quantile Function for Distributional Reinforcement Learning
-
A3C LSTM hs
10150.0
Asynchronous Methods for Deep Reinforcement Learning
-
C51 noop
15600.0
A Distributional Perspective on Reinforcement Learning
-
0 of 45 row(s) selected.
Previous
Next