HyperAI

Question Answering On Bamboogle

Metrics

Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAccuracy
fireact-toward-language-agent-fine-tuning44.0
answering-questions-by-meta-reasoning-over66.5
measuring-and-narrowing-the-compositionality57.6
rest-meets-react-self-improvement-for-multi76.1
measuring-and-narrowing-the-compositionality60.0
measuring-and-narrowing-the-compositionality0
measuring-and-narrowing-the-compositionality46.4
making-retrieval-augmented-language-models62.7
measuring-and-narrowing-the-compositionality17.6