HyperAI

Question Answering On Finqa

Metriken

Execution Accuracy
Program Accuracy

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameExecution AccuracyProgram Accuracy
elastic-numerical-reasoning-with-adaptive68.9665.21
finqa-a-dataset-of-numerical-reasoning-over57.4355.52
finqa-a-dataset-of-numerical-reasoning-over65.0563.52
are-chatgpt-and-gpt-4-general-purpose-solvers68.79-
apollo-an-optimized-training-approach-for71.0768.94
finqa-a-dataset-of-numerical-reasoning-over53.7151.71