Command Palette

Search for a command to run...

Arithmetic Reasoning On Game Of 24

評価指標

Success

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

Paper Title
Tree of Thoughts (b=5)0.74Tree of Thoughts: Deliberate Problem Solving with Large Language Models
0 of 1 row(s) selected.
Arithmetic Reasoning On Game Of 24 | SOTA | HyperAI超神経