Question Answering On Strategyqa
Metriken
Accuracy
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Vergleichstabelle
Modellname | Accuracy |
---|---|
rethinking-with-retrieval-faithful-large | 77.73 |
chain-of-action-faithful-and-multimodal | - |
transcending-scaling-laws-with-0-1-extra | 76.4 |
least-to-most-prompting-enables-complex | - |
Modell 5 | 77.2 |
search-in-the-chain-towards-the-accurate | - |
chain-of-action-faithful-and-multimodal | - |
chain-of-action-faithful-and-multimodal | - |
chain-of-action-faithful-and-multimodal | - |
transcending-scaling-laws-with-0-1-extra | 76.6 |
transcending-scaling-laws-with-0-1-extra | 61.9 |
palm-2-technical-report-1 | 90.4 |