HyperAI

Multi Task Language Understanding On Mgsm

Métriques

Average (%)

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleAverage (%)
transcending-scaling-laws-with-0-1-extra49.9
palm-scaling-language-modeling-with-pathways-155.0
palm-2-technical-report-187.0
scaling-instruction-finetuned-language-models60.4
scaling-instruction-finetuned-language-models72.0
scaling-instruction-finetuned-language-models35
scaling-instruction-finetuned-language-models57.0
scaling-instruction-finetuned-language-models5.7
scaling-instruction-finetuned-language-models36
scaling-instruction-finetuned-language-models21.2
scaling-instruction-finetuned-language-models23.7
palm-2-technical-report-172.2