Question Answering On Convfinqa
Metrics
Execution Accuracy
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Execution Accuracy |
---|---|
are-chatgpt-and-gpt-4-general-purpose-solvers | 46.90 |
convfinqa-exploring-the-chain-of-numerical | 68.9 |
are-chatgpt-and-gpt-4-general-purpose-solvers | 76.48 |