HyperAI

Natural Language Inference On Commitmentbank

Métriques

Accuracy

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleAccuracy
designing-effective-sparse-expert-models98
bloomberggpt-a-large-language-model-for44.64
bloomberggpt-a-large-language-model-for48.21
designing-effective-sparse-expert-models98.2
n-grammer-augmenting-transformers-with-latent-167.9
language-models-are-few-shot-learners75.6
palm-2-technical-report-182.1
bloomberggpt-a-large-language-model-for48.21
palm-2-technical-report-180.4
language-models-are-few-shot-learners-
palm-2-technical-report-187.5
exploring-the-limits-of-transfer-learning96.8
bloomberggpt-a-large-language-model-for53.57
exploring-the-limits-of-transfer-learning94.4
palm-scaling-language-modeling-with-pathways-1100
toward-efficient-language-model-pretraining97.6
toward-efficient-language-model-pretraining99.2
alexatm-20b-few-shot-learning-using-a-large67.9
deberta-decoding-enhanced-bert-with97.2
exploring-the-limits-of-transfer-learning94