HyperAI超神経

Question Answering On Obqa

評価指標

Accuracy

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Accuracy
llama-open-and-efficient-foundation-language-157.2
llama-open-and-efficient-foundation-language-156.4
finetuned-language-models-are-zero-shot78.2
palm-scaling-language-modeling-with-pathways-153.4
finetuned-language-models-are-zero-shot78.4
palm-scaling-language-modeling-with-pathways-150.4
language-models-are-few-shot-learners57.6
llama-open-and-efficient-foundation-language-160.2
llama-open-and-efficient-foundation-language-158.6