Command Palette
Search for a command to run...
Continuous Control On Cartpole Swingup 2
評価指標
Return
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
| Paper Title | ||
|---|---|---|
| SMuZero | 868.87 | Learning and Planning in Complex Action Spaces |
| MuZero Unplugged | 594.3 | Online and Offline Reinforcement Learning by Planning with a Learned Model |
0 of 2 row(s) selected.