HyperAI
HyperAI초신경
홈
플랫폼
문서
뉴스
연구 논문
튜토리얼
데이터셋
백과사전
SOTA
LLM 모델
GPU 랭킹
컨퍼런스
전체 검색
소개
서비스 약관
개인정보 처리방침
한국어
HyperAI
HyperAI초신경
Toggle Sidebar
전체 사이트 검색...
⌘
K
Command Palette
Search for a command to run...
플랫폼
홈
SOTA
아타리 게임
Atari Games On Atari 2600 Centipede
Atari Games On Atari 2600 Centipede
평가 지표
Score
평가 결과
이 벤치마크에서 각 모델의 성능 결과
Columns
모델 이름
Score
Paper Title
Go-Explore
1422628
First return, then explore
GDI-H3(1B frames)
1359533
GDI: Rethinking What Makes Reinforcement Learning Different from Supervised Learning
MuZero
1159049.27
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
MuZero (Res2 Adam)
874301.64
Online and Offline Reinforcement Learning by Planning with a Learned Model
R2D2
599140.3
Recurrent Experience Replay in Distributed Reinforcement Learning
Agent57
412847.86
Agent57: Outperforming the Atari Human Benchmark
GDI-H3
195630
Generalized Data Distribution Iteration
GDI-I3
155830
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning
GDI-I3
155830
Generalized Data Distribution Iteration
Full Tree
125123
The Arcade Learning Environment: An Evaluation Platform for General Agents
DNA
100194
DNA: Proximal Policy Optimization with a Dual Network Architecture
DDQN+Pop-Art noop
49065.8
Learning values across many orders of magnitude
CGP
24708
Evolving simple programs for playing Atari games
Ape-X
12974
Distributed Prioritized Experience Replay
QR-DQN-1
12447
Distributional Reinforcement Learning with Quantile Regression
DreamerV2
11883
Mastering Atari with Discrete World Models
IQN
11561
Implicit Quantile Networks for Distributional Reinforcement Learning
IMPALA (deep)
11049.75
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
C51 noop
9646.0
A Distributional Perspective on Reinforcement Learning
Best Learner
8803.8
The Arcade Learning Environment: An Evaluation Platform for General Agents
0 of 45 row(s) selected.
Previous
Next
Atari Games On Atari 2600 Centipede | SOTA | HyperAI초신경