HyperAI超神経

Math Word Problem Solving On Mawps

評価指標

Accuracy (%)

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名
Accuracy (%)
Paper TitleRepository
GPT-J + CC9.3--
GTS with RoBERTa88.5Are NLP Models really able to Solve Simple Math Word Problems?
ATHENA (roberta-base)92.2ATHENA: Mathematical Reasoning with Thought Expansion
Graph2Tree83.7Graph-to-Tree Learning for Solving Math Word Problems
Graph2Tree with RoBERTa88.7Are NLP Models really able to Solve Simple Math Word Problems?
MsAT-DeductReasoner94.3Learning Multi-Step Reasoning by Solving Arithmetic Tasks
GPT-3.5 turbo (175B)80.3Math Word Problem Solving by Generating Linguistic Variants of Problem Statements
DeBERTa (PM + VM)91.0Math Word Problem Solving by Generating Linguistic Variants of Problem Statements
EPT88.7EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers
Exp-Tree92.3An Expression Tree Decoding Strategy for Mathematical Equation Generation
ATHENA (roberta-large)93ATHENA: Mathematical Reasoning with Thought Expansion
EPT-X84.57EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers
OPT (66B)7.9--
GPT-3 (175B)19.8--
OpenMath-CodeLlama-70B (w/ code)95.7OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
GPT-J9.9Math Word Problem Solving by Generating Linguistic Variants of Problem Statements
EPT84.51Point to the Expression: Solving Algebraic Word Problems using the Expression-Pointer Transformer Model
Roberta-DeductReasoner92Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction
GPT-3 text-babbage-001 (6.7B)2.76Math Word Problem Solving by Generating Linguistic Variants of Problem Statements
Toolformer44.0--
0 of 25 row(s) selected.