HyperAI
Home
News
Latest Papers
Tutorials
Datasets
Events
Wiki
SOTA
LLM Models
Search
About
English
HyperAI
Toggle sidebar
Search the site…
⌘
K
Home
SOTA
Multi Task Language Understanding
Multi Task Language Understanding
Benchmark List
All benchmarks related to this task
bbh-alg
Best model: code-davinci-002 175B (CoT)
Metrics
View Details
bbh-nlp
Best model: Flan-PaLM 540B (3-shot, fine-tuned, CoT + SC)
Metrics
View Details
mgsm
Metrics
View Details
mmlu
Best model: ds-r1(671b)
Metrics
View Details
mmlu-5-shot
Best model: Sakalti/ultiima-78B
Metrics
View Details