HyperAI

Natural Language Inference On Rte

Metrics

Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAccuracy
not-all-layers-are-equally-as-important-every53.7
distilbert-a-distilled-version-of-bert62.9%
guess-the-instruction-making-language-models71.05
ernie-enhanced-language-representation-with68.8%
smart-robust-and-efficient-fine-tuning-for92.5%
albert-a-lite-bert-for-self-supervised89.2%
bloomberggpt-a-large-language-model-for53.8%
palm-scaling-language-modeling-with-pathways-178.7%
knowledge-in-context-towards-knowledgeable74.00
data2vec-a-general-framework-for-self-169.9%
lamini-lm-a-diverse-herd-of-distilled-models65%
palm-scaling-language-modeling-with-pathways-172.9%
q-bert-hessian-based-ultra-low-precision84.7
palm-2-technical-report-179.3%
exploring-the-benefits-of-training-expert64.01
smart-robust-and-efficient-fine-tuning-for71.2%
informer-transformer-likes-informed-attention73.7%
palm-2-technical-report-178.7%
xlnet-generalized-autoregressive-pretraining85.9%
big-bird-transformers-for-longer-sequences75.0%
hungry-hungry-hippos-towards-language53.1%
unifying-language-learning-paradigms60.7%
opt-iml-scaling-language-model-instruction60.3%
structbert-incorporating-language-structures88.7%
not-all-layers-are-equally-as-important-every55.4
ask-me-anything-a-simple-strategy-for75.1%
lamini-lm-a-diverse-herd-of-distilled-models52.3%
bloomberggpt-a-large-language-model-for69.3%
q8bert-quantized-8bit-bert84.8
entailment-as-few-shot-learner87.2%
hungry-hungry-hippos-towards-language58.1%
exploring-the-limits-of-transfer-learning87.2%
designing-effective-sparse-expert-models93.5%
smart-robust-and-efficient-fine-tuning-for92.0%
llm-int8-8-bit-matrix-multiplication-for85.4%
19091035162.9%
the-cot-collection-improving-zero-shot-and80.8%
bert-pre-training-of-deep-bidirectional70.1%
exploring-the-limits-of-transfer-learning80.1%
lamini-lm-a-diverse-herd-of-distilled-models57%
n-grammer-augmenting-transformers-with-latent-159.2%
ask-me-anything-a-simple-strategy-for58.8%
unifying-language-learning-paradigms92.1%
hungry-hungry-hippos-towards-language58.1%
alexatm-20b-few-shot-learning-using-a-large68.6%
opt-iml-scaling-language-model-instruction54.2%
palm-scaling-language-modeling-with-pathways-179.6%
lamini-lm-a-diverse-herd-of-distilled-models87.4%
lamini-lm-a-diverse-herd-of-distilled-models67.9%
palm-scaling-language-modeling-with-pathways-195.7%
palm-2-technical-report-181.9%
smart-robust-and-efficient-fine-tuning-for71.2%
designing-effective-sparse-expert-models92.1%
fnet-mixing-tokens-with-fourier-transforms69%
debertav3-improving-deberta-using-electra92.7%
opt-iml-scaling-language-model-instruction84.8%
Model 5783.6%
entailment-as-few-shot-learner90.5%
squeezebert-what-can-computer-vision-teach73.2%
toward-efficient-language-model-pretraining96%
finetuned-language-models-are-zero-shot84.5%
toward-efficient-language-model-pretraining94.1%
hungry-hungry-hippos-towards-language52.3%
opt-iml-scaling-language-model-instruction66.8%
exploring-the-limits-of-transfer-learning91.1%
19091035166%
bloomberggpt-a-large-language-model-for54.9%
muppet-massive-multi-task-representations92.8%
ask-me-anything-a-simple-strategy-for61.7%
exploring-the-limits-of-transfer-learning92.5%
exploring-the-limits-of-transfer-learning69.9%
language-models-are-few-shot-learners69%
ernie-20-a-continual-pre-training-framework74.8%
sensebert-driving-some-sense-into-bert67.5%
finetuned-language-models-are-zero-shot84.1%
ernie-20-a-continual-pre-training-framework80.2%
roberta-a-robustly-optimized-bert-pretraining88.2%
not-all-layers-are-equally-as-important-every63
opt-iml-scaling-language-model-instruction58.1%
finetuned-language-models-are-zero-shot91.7%
spanbert-improving-pre-training-by79.0%
deberta-decoding-enhanced-bert-with93.2%
hungry-hungry-hippos-towards-language59.2%
roberta-a-robustly-optimized-bert-pretraining88.2%
how-to-train-bert-with-an-academic-budget57.7%
not-all-layers-are-equally-as-important-every54.7
clear-contrastive-learning-for-sentence79.8%
opt-iml-scaling-language-model-instruction83.8%
bloomberggpt-a-large-language-model-for57.4%
a-statistical-framework-for-low-bitwidth86.8