HyperAI초신경

Question Answering On Bamboogle

평가 지표

Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Accuracy
fireact-toward-language-agent-fine-tuning44.0
answering-questions-by-meta-reasoning-over66.5
measuring-and-narrowing-the-compositionality57.6
rest-meets-react-self-improvement-for-multi76.1
measuring-and-narrowing-the-compositionality60.0
measuring-and-narrowing-the-compositionality0
measuring-and-narrowing-the-compositionality46.4
making-retrieval-augmented-language-models62.7
measuring-and-narrowing-the-compositionality17.6