Atari Games On Atari 57
評価指標
Human World Record Breakthrough
Mean Human Normalized Score
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
モデル名 | Human World Record Breakthrough | Mean Human Normalized Score | Paper Title | Repository |
---|---|---|---|---|
LASER | 7 | 1741.36% | Off-Policy Actor-Critic with Shared Experience Replay | - |
GDI-H3(200M frames) | 22 | 9620.98% | GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning | - |
R2D2 | 15 | 3374.31% | Recurrent Experience Replay in Distributed Reinforcement Learning | - |
LBC | 24 | 10077.52% | Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection | - |
IMPALA, deep | 3 | 957.34% | IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures | |
GDI-H3 | - | - | GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning | - |
GDI-H3 | 22 | 9620.33% | Generalized Data Distribution Iteration | - |
GDI-I3 | 17 | 7810.1% | Generalized Data Distribution Iteration | - |
Rainbow DQN | 4 | 873.97% | Rainbow: Combining Improvements in Deep Reinforcement Learning | |
MuZero | 19 | 4996.20% | Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model | |
M-IQN | - | 504% | Munchausen Reinforcement Learning |
0 of 11 row(s) selected.