HyperAI超神経

Math Word Problem Solving On Mawps

評価指標

Accuracy (%)

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Accuracy (%)
モデル 19.3
are-nlp-models-really-able-to-solve-simple88.5
athena-mathematical-reasoning-with-thought92.2
graph-to-tree-learning-for-solving-math-word83.7
are-nlp-models-really-able-to-solve-simple88.7
learning-multi-step-reasoning-from-arithmetic94.3
math-word-problem-solving-by-generating80.3
math-word-problem-solving-by-generating91.0
ept-x-an-expression-pointer-transformer-model88.7
an-expression-tree-decoding-strategy-for92.3
athena-mathematical-reasoning-with-thought93
ept-x-an-expression-pointer-transformer-model84.57
モデル 137.9
モデル 1419.8
openmathinstruct-1-a-1-8-million-math95.7
math-word-problem-solving-by-generating9.9
point-to-the-expression-solving-algebraic84.51
learning-to-reason-deductively-math-word92
math-word-problem-solving-by-generating2.76
モデル 2044.0
multi-view-reasoning-consistent-contrastive92.3
math-word-problem-solving-by-generating4.09
llama-2-open-foundation-and-fine-tuned-chat82.4
モデル 2415.0
generating-equation-by-utilizing-operators85.1