HyperAI超神経

Question Answering On Conditionalqa

評価指標

Conditional (answers)
Conditional (w/ conditions)
Overall (answers)
Overall (w/ conditions)

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Conditional (answers)Conditional (w/ conditions)Overall (answers)Overall (w/ conditions)
etc-encoding-long-and-structured-data-in39.4 / 41.82.5 / 3.435.6 / 39.826.9 / 30.8
leveraging-passage-retrieval-with-generative45.2 / 49.74.7 / 5.844.4 / 50.835.0 / 40.6
end-to-end-multihop-retrieval-for42.0 / 46.43.1 / 3.840.6 / 45.231.9 / 36.0