2048 On 2048
评估指标
Average Score
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Average Score | Paper Title | Repository |
---|---|---|---|
AlphaZero (With Simulator) | 500000 | Planning in Stochastic Environments with a Learned Model | |
MuZero | 300000 | Planning in Stochastic Environments with a Learned Model | |
Beam Search | 1024 | Playing 2048 With Reinforcement Learning | |
DQN (1000 episodes) | 256 | Playing 2048 With Reinforcement Learning | |
Stochastic Muzero | 500000 | Planning in Stochastic Environments with a Learned Model |
0 of 5 row(s) selected.