Open Domain Question Answering On Kilt
Metriken
EM
F1
KILT-EM
KILT-F1
R-Prec
Recall@5
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Vergleichstabelle
Modellname | EM | F1 | KILT-EM | KILT-F1 | R-Prec | Recall@5 |
---|---|---|---|---|---|---|
Modell 1 | 39.75 | 48.43 | 29.09 | 34.7 | 59.42 | 68.24 |
Modell 2 | 46.05 | 56.57 | 0.0 | 0.0 | 0.0 | 0.0 |
Modell 3 | 45.22 | 53.38 | 36.36 | 41.83 | 63.71 | 70.17 |
Modell 4 | 53.74 | 62.24 | 38.78 | 44.4 | 63.16 | 68.19 |
Modell 5 | 0.0 | 0.0 | 0.0 | 0.0 | 59.42 | 68.24 |
Modell 6 | 41.27 | 49.54 | 30.06 | 34.72 | 54.29 | 65.52 |
Modell 7 | 38.64 | 47.09 | 31.99 | 37.58 | 60.66 | 46.79 |
Modell 8 | 0.35 | 3.72 | 0.0 | 0.0 | 0.0 | 0.0 |
re2g-retrieve-rerank-generate-2 | 51.73 | 60.97 | 43.56 | 49.8 | 70.78 | 76.63 |
Modell 10 | 21.75 | 28.69 | 0.0 | 0.0 | 0.0 | 0.0 |
Modell 11 | 51.59 | 60.83 | 35.32 | 40.73 | 59.83 | 71.17 |
Modell 12 | 0.0 | 0.0 | 0.0 | 0.0 | 62.6 | 64.95 |
kilt-a-benchmark-for-knowledge-intensive | 19.6 | 27.73 | 0.0 | 0.0 | 0.0 | 0.0 |
Modell 14 | 0.0 | 0.0 | 0.0 | 0.0 | 60.32 | 61.21 |
Modell 15 | 0.0 | 0.0 | 0.0 | 0.0 | 60.25 | 61.36 |
Modell 16 | 44.39 | 52.35 | 32.69 | 37.91 | 59.49 | 67.06 |