Question Answering On Quac
Metriken
F1
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Modellname | F1 | Paper Title | Repository |
---|---|---|---|
GPT-3 175B (few-shot, k=32) | 44.3 | Language Models are Few-Shot Learners | |
FlowQA (single model) | 64.1 | FlowQA: Grasping Flow in History for Conversational Machine Comprehension |
0 of 2 row(s) selected.