HyperAI

Language Modelling On Lambada

Métriques

Accuracy

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleAccuracy
massive-language-models-can-be-accurately0.02
all-nlp-tasks-are-generation-tasks-a-general72.35
language-models-are-few-shot-learners86.4
test-time-training-for-out-of-distribution-10.01
using-deepspeed-and-megatron-to-trainMegatron-Turing NLG 530B (Few-Shot)
palm-2-technical-report-183.7
language-models-are-few-shot-learners72.5
broad-context-language-modeling-as-reading49.0
language-models-are-unsupervised-multitask63.24
language-models-are-few-shot-learners67.1
pythia-a-suite-for-analyzing-large-language-
palm-2-technical-report-186.9
stay-on-topic-with-classifier-free-guidance83.9
universal-transformers56.25
massive-language-models-can-be-accurately79.47
massive-language-models-can-be-accurately76.51
palm-scaling-language-modeling-with-pathways-177.9
pythia-a-suite-for-analyzing-large-language67.28
glam-efficient-scaling-of-language-models80.9
residual-shuffle-exchange-networks-for-fast54.34
glm-130b-an-open-bilingual-pre-trained-model80.2
palm-scaling-language-modeling-with-pathways-189.7
palm-2-technical-report-180.7
Modèle 2482.33
language-models-are-few-shot-learners70.3
pythia-a-suite-for-analyzing-large-language-
stay-on-topic-with-classifier-free-guidance82.2
stay-on-topic-with-classifier-free-guidance84.0
language-models-are-few-shot-learners76.2
Modèle 3069.7
mamba-linear-time-sequence-modeling-with69.2
massive-language-models-can-be-accurately75.59
pythia-a-suite-for-analyzing-large-language70.46
palm-scaling-language-modeling-with-pathways-181.8
all-nlp-tasks-are-generation-tasks-a-general67.18
training-compute-optimal-large-language77.7
massive-language-models-can-be-accurately78.77