HyperAIHyperAI

Command Palette

Search for a command to run...

Math Word Problem Solving On Svamp 1 N

Métriques

Execution Accuracy

Résultats

Résultats de performance de divers modèles sur ce benchmark

Paper Title
ATHENA (roberta-large)67.8ATHENA: Mathematical Reasoning with Thought Expansion
ATHENA (roberta-base)52.5ATHENA: Mathematical Reasoning with Thought Expansion
0 of 2 row(s) selected.