Fact Verification On Kilt Fever
Metrics
Accuracy
KILT-AC
R-Prec
Recall@5
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Accuracy | KILT-AC | R-Prec | Recall@5 |
---|---|---|---|---|
Model 1 | 85.58 | 64.41 | 75.6 | 84.95 |
Model 2 | 89.54 | 71.28 | 81.45 | 89.56 |
Model 3 | 66.1 | 41.88 | 49.24 | 70.16 |
Model 4 | 69.41 | 0.0 | 0.0 | 0.0 |
Model 5 | 70.71 | 0.0 | 0.0 | 0.0 |
Model 6 | 12.57 | 0.0 | 0.0 | 0.0 |
Model 7 | 78.93 | 0.0 | 0.0 | 0.0 |
Model 8 | 71.42 | 0.0 | 0.0 | 0.0 |
Model 9 | 88.99 | 65.68 | 74.77 | 87.89 |
Model 10 | 72.34 | 0.0 | 0.0 | 0.0 |
kilt-a-benchmark-for-knowledge-intensive | 76.3 | 0.0 | 0.0 | 0.0 |
kilt-a-benchmark-for-knowledge-intensive | 86.31 | 53.45 | 61.94 | 75.55 |
Model 13 | 71.38 | 0.0 | 0.0 | 0.0 |
Model 14 | 69.68 | 58.58 | 72.93 | 73.52 |
re2g-retrieve-rerank-generate-2 | 89.55 | 78.53 | 88.92 | 92.52 |
Model 16 | 23.01 | 0.0 | 0.0 | 0.0 |
Model 17 | 86.74 | 47.68 | 55.33 | 74.29 |
Model 18 | 71.12 | 0.0 | 0.0 | 0.0 |
Model 19 | 86.32 | 63.94 | 74.48 | 87.52 |
Model 20 | 0.0 | 0.0 | 74.48 | 87.52 |
Model 21 | 0.0 | 0.0 | 83.64 | 88.15 |
Model 22 | 88.45 | 0.0 | 0.0 | 0.0 |
Model 23 | 61.6 | 0.0 | 0.0 | 0.0 |
Model 24 | 71.24 | 0.0 | 0.0 | 0.0 |
Model 25 | 71.58 | 0.0 | 0.0 | 0.0 |
Model 26 | 67.98 | 0.0 | 0.0 | 0.0 |
Model 27 | 68.43 | 0.0 | 0.0 | 0.0 |
Model 28 | 0.0 | 0.0 | 84.45 | 88.62 |
Model 29 | 33.58 | 0.0 | 0.0 | 0.0 |
Model 30 | 69.71 | 0.0 | 0.0 | 0.0 |
Model 31 | 89.12 | 0.0 | 0.0 | 0.0 |
Model 32 | 76.26 | 0.0 | 0.0 | 0.0 |
Model 33 | 0.0 | 0.0 | 84.07 | 89.41 |