Question Answering On Multitq
評価指標
Hits@1
Hits@10
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
比較表
モデル名 | Hits@1 | Hits@10 |
---|---|---|
question-answering-over-temporal-knowledge | 27.9 | 63.5 |
muti-view-mouse-social-behaviour-recognition | 29.3 | 44.1 |
distilbert-a-distilled-version-of-bert | 8.3 | 48.4 |
improving-multi-hop-question-answering-over | 20.6 | 60.8 |
bert-pre-training-of-deep-bidirectional | 8.3 | 48.2 |
モデル 6 | 10.2 | - |
self-improvement-programming-for-temporal | 79.7 | 91 |
llama-2-open-foundation-and-fine-tuned-chat | 18.5 | - |
temporal-knowledge-question-answering-via | 38 | - |
albert-a-lite-bert-for-self-supervised | 10.8 | 45.9 |
モデル 11 | 72.8 | - |