HyperAI초신경

Question Answering On Conditionalqa

평가 지표

Conditional (answers)
Conditional (w/ conditions)
Overall (answers)
Overall (w/ conditions)

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Conditional (answers)Conditional (w/ conditions)Overall (answers)Overall (w/ conditions)
etc-encoding-long-and-structured-data-in39.4 / 41.82.5 / 3.435.6 / 39.826.9 / 30.8
leveraging-passage-retrieval-with-generative45.2 / 49.74.7 / 5.844.4 / 50.835.0 / 40.6
end-to-end-multihop-retrieval-for42.0 / 46.43.1 / 3.840.6 / 45.231.9 / 36.0