HyperAI초신경
홈
뉴스
최신 연구 논문
튜토리얼
데이터셋
백과사전
SOTA
LLM 모델
GPU 랭킹
컨퍼런스
전체 검색
소개
한국어
HyperAI초신경
Toggle sidebar
전체 사이트 검색...
⌘
K
홈
SOTA
Atari Games
Atari Games On Atari 2600 Centipede
Atari Games On Atari 2600 Centipede
평가 지표
Score
평가 결과
이 벤치마크에서 각 모델의 성능 결과
Columns
모델 이름
Score
Paper Title
Repository
GDI-I3
155830
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning
-
A2C + SIL
7559.5
Self-Imitation Learning
Bootstrapped DQN
4553.5
Deep Exploration via Bootstrapped DQN
DQN noop
4657.7
Deep Reinforcement Learning with Double Q-learning
Prior+Duel hs
5570.2
Deep Reinforcement Learning with Double Q-learning
GDI-I3
155830
Generalized Data Distribution Iteration
-
MuZero (Res2 Adam)
874301.64
Online and Offline Reinforcement Learning by Planning with a Learned Model
ES FF (1 hour) noop
7783.9
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Duel hs
4881.0
Dueling Network Architectures for Deep Reinforcement Learning
A3C FF (1 day) hs
3306.5
Asynchronous Methods for Deep Reinforcement Learning
Go-Explore
1422628
First return, then explore
A3C LSTM hs
1997.0
Asynchronous Methods for Deep Reinforcement Learning
C51 noop
9646.0
A Distributional Perspective on Reinforcement Learning
R2D2
599140.3
Recurrent Experience Replay in Distributed Reinforcement Learning
-
DQN hs
3973.9
Deep Reinforcement Learning with Double Q-learning
Persistent AL
4539.55
Increasing the Action Gap: New Operators for Reinforcement Learning
DDQN (tuned) hs
3853.5
Deep Reinforcement Learning with Double Q-learning
Gorila
6296.9
Massively Parallel Methods for Deep Reinforcement Learning
SARSA
4647.0
-
-
GDI-H3
195630
Generalized Data Distribution Iteration
-
0 of 45 row(s) selected.
Previous
Next