Question Answering On Fever

평가 결과

이 벤치마크에서 각 모델의 성능 결과

		Paper Title	Repository
CoA	68.9	Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
Self-Ask	64.2	Measuring and Narrowing the Compositionality Gap in Language Models
Self-Ask	64.2	Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
DSP	62.2	DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
DSP	62.2	Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
CoA w/o actions	54.2	Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
Zero-shot	50	Language Models are Unsupervised Multitask Learners	-
Zero-shot	50	Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models

0 of 8 row(s) selected.