Math Word Problem Solving On Svamp 1 N

評価指標

Execution Accuracy

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名
Execution Accuracy
Paper TitleRepository
ATHENA (roberta-large)67.8ATHENA: Mathematical Reasoning with Thought Expansion-
ATHENA (roberta-base)52.5ATHENA: Mathematical Reasoning with Thought Expansion-
0 of 2 row(s) selected.
Math Word Problem Solving On Svamp 1 N | SOTA | HyperAI超神経