HyperAI초신경

Atari Games On Atari 2600 Zaxxon

평가 지표

Score

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Score
Paper TitleRepository
Best Learner3365.1The Arcade Learning Environment: An Evaluation Platform for General Agents
DQN hs4412.0Deep Reinforcement Learning with Double Q-learning
DreamerV250699Mastering Atari with Discrete World Models
NoisyNet-Dueling14874Noisy Networks for Exploration
IQN21772Implicit Quantile Networks for Distributional Reinforcement Learning
RIMs-PPO15000Recurrent Independent Mechanisms
DDQN+Pop-Art noop14402.0Learning values across many orders of magnitude-
DNA22588DNA: Proximal Policy Optimization with a Dual Network Architecture
QR-DQN-113112Distributional Reinforcement Learning with Quantile Regression
Prior+Duel noop13886.0Dueling Network Architectures for Deep Reinforcement Learning
Agent57249808.9Agent57: Outperforming the Atari Human Benchmark
GDI-H3216020Generalized Data Distribution Iteration-
MuZero (Res2 Adam)154131.86Online and Offline Reinforcement Learning by Planning with a Learned Model
GDI-I3109140Generalized Data Distribution Iteration-
IMPALA (deep)32935.50IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Gorila6159.4Massively Parallel Methods for Deep Reinforcement Learning
Prior hs9474.0Prioritized Experience Replay
A3C FF (1 day) hs2659.0Asynchronous Methods for Deep Reinforcement Learning
POP3D9472Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
Advantage Learning9129.61Increasing the Action Gap: New Operators for Reinforcement Learning
0 of 41 row(s) selected.