Question Answering On Finqa
평가 지표
Execution Accuracy
Program Accuracy
평가 결과
이 벤치마크에서 각 모델의 성능 결과
비교 표
모델 이름 | Execution Accuracy | Program Accuracy |
---|---|---|
elastic-numerical-reasoning-with-adaptive | 68.96 | 65.21 |
finqa-a-dataset-of-numerical-reasoning-over | 57.43 | 55.52 |
finqa-a-dataset-of-numerical-reasoning-over | 65.05 | 63.52 |
are-chatgpt-and-gpt-4-general-purpose-solvers | 68.79 | - |
apollo-an-optimized-training-approach-for | 71.07 | 68.94 |
finqa-a-dataset-of-numerical-reasoning-over | 53.71 | 51.71 |