Question Answering On Convfinqa
평가 지표
Execution Accuracy
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | Execution Accuracy | Paper Title | Repository |
---|---|---|---|
General Crowd | 46.90 | Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks | - |
FinQANet (RoBERTa-large) | 68.9 | ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering | |
GPT-4 (8k) | 76.48 | Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks | - |
0 of 3 row(s) selected.