Text Generation
Benchmark List
All benchmarks related to this task
chinese-poems
Best model: RankGAN
Metrics
View Details
cmu-se
Best model: STWGAN-GP
Metrics
View Details
cnn-daily-mail-1
Best model: PALM
Metrics
View Details
coco-captions
Best model: LeakGAN
Metrics
View Details
commongen
Best model: UniLM
Metrics
View Details
emnlp2017-wmt
Best model: LeakGAN
Metrics
View Details
harmfulqa
Best model: GPT-4
Metrics
View Details
one-billion-word
Best model: WGANGP + DGflow
Metrics
View Details
redial
Best model: UniCRS
Metrics
View Details
rocstories
Best model: Beam search + A*esque (sample)
Metrics
View Details
yahoo-questions
Best model: Aggressive VAE
Metrics
View Details
adgen
Metrics
View Details
ai2-reasoning-challenge-25-shot
Metrics
View Details
ai2-reasoning-challenge-tr
Metrics
View Details
alpaca-eval-pt
Metrics
View Details
alpacaeval
Metrics
View Details
arc-challenge-pt
Metrics
View Details
assin2-rte
Metrics
View Details
assin2-sts
Metrics
View Details
bbh-3-shot
Metrics
View Details
bluex-no-images
Metrics
View Details
calame-pt
Metrics
View Details
censorship-0-shot
Metrics
View Details
creativity-0-shot
Metrics
View Details
crimestats
Metrics
View Details
csl
Metrics
View Details
czech-restaurant-information
Metrics
View Details
dailydialog
Metrics
View Details
dart
Metrics
View Details
drop-3-shot
Metrics
View Details
enem-challenge-no-images
Metrics
View Details
faquad-nli
Metrics
View Details
gpqa-0-shot
Metrics
View Details
gsm8k-5-shot
Metrics
View Details
gsm8k-tr
Metrics
View Details
hatebr-binary
Metrics
View Details
hellaswag-10-shot
Metrics
View Details
hellaswag-pt
Metrics
View Details
hellaswag-tr
Metrics
View Details
humanness-0-shot
Metrics
View Details
ifeval-0-shot
Metrics
View Details
internet
Metrics
View Details
lambada-pt
Metrics
View Details
lcsts
Metrics
View Details
ldc2016e25
Metrics
View Details
math-lvl-5-4-shot
Metrics
View Details
mmlu-5-shot
Metrics
View Details
mmlu-pro-5-shot
Metrics
View Details
mmlu-tr
Metrics
View Details
mt-bench
Metrics
View Details
mt-bench-jp
Metrics
View Details
musr-0-shot
Metrics
View Details
oab-exams
Metrics
View Details
open-mindedness-0-shot
Metrics
View Details
openwebtext
Metrics
View Details
polcontro
Metrics
View Details
pt-hate-speech-binary
Metrics
View Details
sciq
Metrics
View Details
stories-jokes
Metrics
View Details
talking-0-shot
Metrics
View Details
truthfulqa
Metrics
View Details
truthfulqa-0-shot
Metrics
View Details
truthfulqa-pt
Metrics
View Details
truthfulqa-tr
Metrics
View Details
tweetsentbr
Metrics
View Details
unruly
Metrics
View Details
w-10
Metrics
View Details
wikitext-103
Metrics
View Details
winogrande-5-shot
Metrics
View Details
winogrande-tr
Metrics
View Details
world-knowledge-0-shot
Metrics
View Details