HyperAI
HyperAI초신경
홈
플랫폼
문서
뉴스
연구 논문
튜토리얼
데이터셋
백과사전
SOTA
LLM 모델
GPU 랭킹
컨퍼런스
전체 검색
소개
서비스 약관
개인정보 처리방침
한국어
HyperAI
HyperAI초신경
Toggle Sidebar
전체 사이트 검색...
⌘
K
Command Palette
Search for a command to run...
플랫폼
홈
SOTA
아타리 게임
Atari Games On Atari 2600 Beam Rider
Atari Games On Atari 2600 Beam Rider
평가 지표
Score
평가 결과
이 벤치마크에서 각 모델의 성능 결과
Columns
모델 이름
Score
Paper Title
MuZero
454993.53
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
GDI-H3
422890
Generalized Data Distribution Iteration
MuZero (Res2 Adam)
333077.44
Online and Offline Reinforcement Learning by Planning with a Learned Model
Agent57
300509.8
Agent57: Outperforming the Atari Human Benchmark
R2D2
188257.4
Recurrent Experience Replay in Distributed Reinforcement Learning
GDI-I3
162100
Generalized Data Distribution Iteration
GDI-I3
162100
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning
Ape-X
63305.2
Distributed Prioritized Experience Replay
IQN
42776
Implicit Quantile Networks for Distributional Reinforcement Learning
Prior+Duel hs
37412.2
Deep Reinforcement Learning with Double Q-learning
QR-DQN-1
34821
Distributional Reinforcement Learning with Quantile Regression
IMPALA (deep)
32463.47
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Prior hs
31181.3
Prioritized Experience Replay
Prior+Duel noop
30276.5
Dueling Network Architectures for Deep Reinforcement Learning
ASL DDQN
26841.6
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity
A3C LSTM hs
24622.2
Asynchronous Methods for Deep Reinforcement Learning
Bootstrapped DQN
23429.8
Deep Exploration via Bootstrapped DQN
Prior noop
23384.2
Prioritized Experience Replay
NoisyNet-Dueling
23134
Noisy Networks for Exploration
A3C FF hs
22707.9
Asynchronous Methods for Deep Reinforcement Learning
0 of 49 row(s) selected.
Previous
Next