HyperAI

Math Word Problem Solving On Svamp 1 N

Metrics

Execution Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model NameExecution Accuracy
athena-mathematical-reasoning-with-thought67.8
athena-mathematical-reasoning-with-thought52.5