Arithmetic Reasoning On Mathtof
評価指標
Accuracy
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
モデル名 | Accuracy | Paper Title | Repository |
---|---|---|---|
GPT-4 (Teaching-Inspired) | 89.2 | Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models | - |
0 of 1 row(s) selected.