HyperAI초신경
홈
뉴스
최신 연구 논문
튜토리얼
데이터셋
백과사전
SOTA
LLM 모델
GPU 랭킹
컨퍼런스
전체 검색
소개
한국어
HyperAI초신경
Toggle sidebar
전체 사이트 검색...
⌘
K
홈
SOTA
Atari Games
Atari Games On Atari 2600 Gravitar
Atari Games On Atari 2600 Gravitar
평가 지표
Score
평가 결과
이 벤치마크에서 각 모델의 성능 결과
Columns
모델 이름
Score
Paper Title
Repository
SARSA
429.0
-
-
CGP
2350
Evolving simple programs for playing Atari games
Nature DQN
306.7
Human level control through deep reinforcement learning
ES FF (1 hour) noop
805.0
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Agent57
19213.96
Agent57: Outperforming the Atari Human Benchmark
SND-STD
4643
Self-supervised network distillation: an effective approach to exploration in sparse reward environments
MuZero
6682.70
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
SND-V
2741
Self-supervised network distillation: an effective approach to exploration in sparse reward environments
SND-VIC
6712
Self-supervised network distillation: an effective approach to exploration in sparse reward environments
A2C + SIL
1874.2
Self-Imitation Learning
A3C LSTM hs
320.0
Asynchronous Methods for Deep Reinforcement Learning
GDI-I3
5905
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning
-
DQNMMCe
1078.3
Count-Based Exploration with the Successor Representation
DDQN+Pop-Art noop
483.5
Learning values across many orders of magnitude
-
Duel noop
588.0
Dueling Network Architectures for Deep Reinforcement Learning
MuZero (Res2 Adam)
8006.93
Online and Offline Reinforcement Learning by Planning with a Learned Model
DQN hs
298.0
Deep Reinforcement Learning with Double Q-learning
IMPALA (deep)
359.50
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
C51 noop
440.0
A Distributional Perspective on Reinforcement Learning
GDI-H3
5915
Generalized Data Distribution Iteration
-
0 of 53 row(s) selected.
Previous
Next