HyperAI超神経

Multi Task Language Understanding On Mgsm

評価指標

Average (%)

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Average (%)
transcending-scaling-laws-with-0-1-extra49.9
palm-scaling-language-modeling-with-pathways-155.0
palm-2-technical-report-187.0
scaling-instruction-finetuned-language-models60.4
scaling-instruction-finetuned-language-models72.0
scaling-instruction-finetuned-language-models35
scaling-instruction-finetuned-language-models57.0
scaling-instruction-finetuned-language-models5.7
scaling-instruction-finetuned-language-models36
scaling-instruction-finetuned-language-models21.2
scaling-instruction-finetuned-language-models23.7
palm-2-technical-report-172.2