HyperAI
HyperAI
Home
Console
Docs
News
Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
Search the site…
⌘
K
Command Palette
Search for a command to run...
Console
Home
SOTA
Natural Language Inference
Natural Language Inference On Commitmentbank
Natural Language Inference On Commitmentbank
Metrics
Accuracy
Results
Performance results of various models on this benchmark
Columns
Model Name
Accuracy
Paper Title
PaLM 540B (finetuned)
100
PaLM: Scaling Language Modeling with Pathways
Vega v2 6B (KD-based prompt transfer)
99.2
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
ST-MoE-L 4.1B (fine-tuned)
98.2
ST-MoE: Designing Stable and Transferable Sparse Expert Models
ST-MoE-32B 269B (fine-tuned)
98
ST-MoE: Designing Stable and Transferable Sparse Expert Models
Turing NLR v5 XXL 5.4B (fine-tuned)
97.6
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
DeBERTa-1.5B
97.2
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
T5-XXL 11B (fine-tuned)
96.8
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
T5-Large 770M (fine-tuned)
94.4
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
T5-Base 220M (fine-tuned)
94
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
PaLM 2-L (one-shot)
87.5
PaLM 2 Technical Report
PaLM 2-S (one-shot)
82.1
PaLM 2 Technical Report
PaLM 2-M (one-shot)
80.4
PaLM 2 Technical Report
GPT-3 175B (Few-Shot)
75.6
Language Models are Few-Shot Learners
N-Grammer 343M
67.9
N-Grammer: Augmenting Transformers with latent n-grams
AlexaTM 20B
67.9
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model
Bloomberg GPT (one-shot)
53.57
BloombergGPT: A Large Language Model for Finance
GPT-NeoX (one-shot)
48.21
BloombergGPT: A Large Language Model for Finance
BLOOM 176B (one-shot)
48.21
BloombergGPT: A Large Language Model for Finance
OPT 66B (one-shot)
44.64
BloombergGPT: A Large Language Model for Finance
GPT-3 175B (few-shot, k=32)
-
Language Models are Few-Shot Learners
0 of 20 row(s) selected.
Previous
Next
Natural Language Inference On Commitmentbank | SOTA | HyperAI