Question Answering On Strategyqa
평가 지표
Accuracy
평가 결과
이 벤치마크에서 각 모델의 성능 결과
비교 표
모델 이름 | Accuracy |
---|---|
rethinking-with-retrieval-faithful-large | 77.73 |
chain-of-action-faithful-and-multimodal | - |
transcending-scaling-laws-with-0-1-extra | 76.4 |
least-to-most-prompting-enables-complex | - |
모델 5 | 77.2 |
search-in-the-chain-towards-the-accurate | - |
chain-of-action-faithful-and-multimodal | - |
chain-of-action-faithful-and-multimodal | - |
chain-of-action-faithful-and-multimodal | - |
transcending-scaling-laws-with-0-1-extra | 76.6 |
transcending-scaling-laws-with-0-1-extra | 61.9 |
palm-2-technical-report-1 | 90.4 |