Open Domain Question Answering On Kilt
Metrics
EM
F1
KILT-EM
KILT-F1
R-Prec
Recall@5
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | EM | F1 | KILT-EM | KILT-F1 | R-Prec | Recall@5 |
---|---|---|---|---|---|---|
Model 1 | 39.75 | 48.43 | 29.09 | 34.7 | 59.42 | 68.24 |
Model 2 | 46.05 | 56.57 | 0.0 | 0.0 | 0.0 | 0.0 |
Model 3 | 45.22 | 53.38 | 36.36 | 41.83 | 63.71 | 70.17 |
Model 4 | 53.74 | 62.24 | 38.78 | 44.4 | 63.16 | 68.19 |
Model 5 | 0.0 | 0.0 | 0.0 | 0.0 | 59.42 | 68.24 |
Model 6 | 41.27 | 49.54 | 30.06 | 34.72 | 54.29 | 65.52 |
Model 7 | 38.64 | 47.09 | 31.99 | 37.58 | 60.66 | 46.79 |
Model 8 | 0.35 | 3.72 | 0.0 | 0.0 | 0.0 | 0.0 |
re2g-retrieve-rerank-generate-2 | 51.73 | 60.97 | 43.56 | 49.8 | 70.78 | 76.63 |
Model 10 | 21.75 | 28.69 | 0.0 | 0.0 | 0.0 | 0.0 |
Model 11 | 51.59 | 60.83 | 35.32 | 40.73 | 59.83 | 71.17 |
Model 12 | 0.0 | 0.0 | 0.0 | 0.0 | 62.6 | 64.95 |
kilt-a-benchmark-for-knowledge-intensive | 19.6 | 27.73 | 0.0 | 0.0 | 0.0 | 0.0 |
Model 14 | 0.0 | 0.0 | 0.0 | 0.0 | 60.32 | 61.21 |
Model 15 | 0.0 | 0.0 | 0.0 | 0.0 | 60.25 | 61.36 |
Model 16 | 44.39 | 52.35 | 32.69 | 37.91 | 59.49 | 67.06 |