HyperAI

Question Answering On Finqa

Métriques

Execution Accuracy
Program Accuracy

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleExecution AccuracyProgram Accuracy
elastic-numerical-reasoning-with-adaptive68.9665.21
finqa-a-dataset-of-numerical-reasoning-over57.4355.52
finqa-a-dataset-of-numerical-reasoning-over65.0563.52
are-chatgpt-and-gpt-4-general-purpose-solvers68.79-
apollo-an-optimized-training-approach-for71.0768.94
finqa-a-dataset-of-numerical-reasoning-over53.7151.71