Question Answering On Strategyqa
評価指標
Accuracy
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
比較表
モデル名 | Accuracy |
---|---|
rethinking-with-retrieval-faithful-large | 77.73 |
chain-of-action-faithful-and-multimodal | - |
transcending-scaling-laws-with-0-1-extra | 76.4 |
least-to-most-prompting-enables-complex | - |
モデル 5 | 77.2 |
search-in-the-chain-towards-the-accurate | - |
chain-of-action-faithful-and-multimodal | - |
chain-of-action-faithful-and-multimodal | - |
chain-of-action-faithful-and-multimodal | - |
transcending-scaling-laws-with-0-1-extra | 76.6 |
transcending-scaling-laws-with-0-1-extra | 61.9 |
palm-2-technical-report-1 | 90.4 |