HyperAI초신경

Question Answering On Finqa

평가 지표

Execution Accuracy
Program Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Execution AccuracyProgram Accuracy
elastic-numerical-reasoning-with-adaptive68.9665.21
finqa-a-dataset-of-numerical-reasoning-over57.4355.52
finqa-a-dataset-of-numerical-reasoning-over65.0563.52
are-chatgpt-and-gpt-4-general-purpose-solvers68.79-
apollo-an-optimized-training-approach-for71.0768.94
finqa-a-dataset-of-numerical-reasoning-over53.7151.71