HyperAI超神経

Question Answering On Bamboogle

評価指標

Accuracy

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Accuracy
fireact-toward-language-agent-fine-tuning44.0
answering-questions-by-meta-reasoning-over66.5
measuring-and-narrowing-the-compositionality57.6
rest-meets-react-self-improvement-for-multi76.1
measuring-and-narrowing-the-compositionality60.0
measuring-and-narrowing-the-compositionality0
measuring-and-narrowing-the-compositionality46.4
making-retrieval-augmented-language-models62.7
measuring-and-narrowing-the-compositionality17.6