HyperAI

Question Answering On Webquestions

Metriken

EM

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameEM
palm-scaling-language-modeling-with-pathways-110.6
language-models-are-few-shot-learners41.5
palm-2-technical-report-121.8
chain-of-action-faithful-and-multimodal26.3
tree-of-thoughts-deliberate-problem-solving-126.3
react-synergizing-reasoning-and-acting-in38.3
chain-of-action-faithful-and-multimodal59.4
realm-retrieval-augmented-language-model-pre40.7
chain-of-thought-prompting-elicits-reasoning42.5
language-models-are-few-shot-learners14.4
chain-of-action-faithful-and-multimodal42.5
large-scale-simple-question-answering-with-
palm-2-technical-report-128.2
exploring-the-limits-of-transfer-learning42.8
palm-scaling-language-modeling-with-pathways-122.6
language-models-are-few-shot-learners25.3
fie-building-a-global-probability-space-by56.3
dense-passage-retrieval-for-open-domain42.4
measuring-and-narrowing-the-compositionality31.1
question-answering-with-subgraph-embeddings-
19060030036.4
chain-of-action-faithful-and-multimodal31.1
language-models-are-few-shot-learners44.7
retrieval-augmented-generation-for-knowledge45.2
glam-efficient-scaling-of-language-models15.5
palm-scaling-language-modeling-with-pathways-143.5
fido-fusion-in-decoder-optimized-for-stronger51.1
chain-of-action-faithful-and-multimodal70.7
dspy-compiling-declarative-language-model59.4
chain-of-action-faithful-and-multimodal38.3
chain-of-action-faithful-and-multimodal43
chain-of-action-faithful-and-multimodal64.7
palm-2-technical-report-126.9
open-question-answering-with-weakly-
language-models-are-unsupervised-multitask43
chain-of-action-faithful-and-multimodal44.7
fie-building-a-global-probability-space-by52.4