HyperAI超神経

Question Answering On Multirc

評価指標

EM

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名EM
hungry-hungry-hippos-towards-language48.9
finetuned-language-models-are-zero-shot-
deberta-decoding-enhanced-bert-with63.7
bloomberggpt-a-large-language-model-for-
exploring-the-limits-of-transfer-learning-
bloomberggpt-a-large-language-model-for-
exploring-the-limits-of-transfer-learning63.3
bloomberggpt-a-large-language-model-for-
kelm-knowledge-enhanced-pre-trained-language27.2
hungry-hungry-hippos-towards-language59.5
palm-2-technical-report-1-
designing-effective-sparse-expert-models-
bert-pre-training-of-deep-bidirectional24.1
language-models-are-few-shot-learners-
hungry-hungry-hippos-towards-language59.7
palm-scaling-language-modeling-with-pathways-169.2
ask-me-anything-a-simple-strategy-for-
hungry-hungry-hippos-towards-language51.4
toward-efficient-language-model-pretraining63
ask-me-anything-a-simple-strategy-for-
ask-me-anything-a-simple-strategy-for-
palm-2-technical-report-1-
finetuned-language-models-are-zero-shot-
n-grammer-augmenting-transformers-with-latent-111.3
designing-effective-sparse-expert-models-
alexatm-20b-few-shot-learning-using-a-large-
bloomberggpt-a-large-language-model-for-
toward-efficient-language-model-pretraining62.4
palm-2-technical-report-1-
finetuned-language-models-are-zero-shot-