Question Answering On Webquestions
Metriken
EM
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Vergleichstabelle
Modellname | EM |
---|---|
palm-scaling-language-modeling-with-pathways-1 | 10.6 |
language-models-are-few-shot-learners | 41.5 |
palm-2-technical-report-1 | 21.8 |
chain-of-action-faithful-and-multimodal | 26.3 |
tree-of-thoughts-deliberate-problem-solving-1 | 26.3 |
react-synergizing-reasoning-and-acting-in | 38.3 |
chain-of-action-faithful-and-multimodal | 59.4 |
realm-retrieval-augmented-language-model-pre | 40.7 |
chain-of-thought-prompting-elicits-reasoning | 42.5 |
language-models-are-few-shot-learners | 14.4 |
chain-of-action-faithful-and-multimodal | 42.5 |
large-scale-simple-question-answering-with | - |
palm-2-technical-report-1 | 28.2 |
exploring-the-limits-of-transfer-learning | 42.8 |
palm-scaling-language-modeling-with-pathways-1 | 22.6 |
language-models-are-few-shot-learners | 25.3 |
fie-building-a-global-probability-space-by | 56.3 |
dense-passage-retrieval-for-open-domain | 42.4 |
measuring-and-narrowing-the-compositionality | 31.1 |
question-answering-with-subgraph-embeddings | - |
190600300 | 36.4 |
chain-of-action-faithful-and-multimodal | 31.1 |
language-models-are-few-shot-learners | 44.7 |
retrieval-augmented-generation-for-knowledge | 45.2 |
glam-efficient-scaling-of-language-models | 15.5 |
palm-scaling-language-modeling-with-pathways-1 | 43.5 |
fido-fusion-in-decoder-optimized-for-stronger | 51.1 |
chain-of-action-faithful-and-multimodal | 70.7 |
dspy-compiling-declarative-language-model | 59.4 |
chain-of-action-faithful-and-multimodal | 38.3 |
chain-of-action-faithful-and-multimodal | 43 |
chain-of-action-faithful-and-multimodal | 64.7 |
palm-2-technical-report-1 | 26.9 |
open-question-answering-with-weakly | - |
language-models-are-unsupervised-multitask | 43 |
chain-of-action-faithful-and-multimodal | 44.7 |
fie-building-a-global-probability-space-by | 52.4 |