HyperAI

Text Generation

Benchmark List

All benchmarks related to this task

chinese-poems
Best model: RankGAN

Metrics

View Details
cmu-se
Best model: STWGAN-GP

Metrics

View Details
cnn-daily-mail-1
Best model: PALM

Metrics

View Details
coco-captions
Best model: LeakGAN

Metrics

View Details
commongen
Best model: UniLM

Metrics

View Details
emnlp2017-wmt
Best model: LeakGAN

Metrics

View Details
harmfulqa
Best model: GPT-4

Metrics

View Details
one-billion-word
Best model: WGANGP + DGflow

Metrics

View Details
redial
Best model: UniCRS

Metrics

View Details
rocstories
Best model: Beam search + A*esque (sample)

Metrics

View Details
yahoo-questions
Best model: Aggressive VAE

Metrics

View Details
adgen

Metrics

View Details
ai2-reasoning-challenge-25-shot

Metrics

View Details
ai2-reasoning-challenge-tr

Metrics

View Details
alpaca-eval-pt

Metrics

View Details
alpacaeval

Metrics

View Details
arc-challenge-pt

Metrics

View Details
assin2-rte

Metrics

View Details
assin2-sts

Metrics

View Details
bbh-3-shot

Metrics

View Details
bluex-no-images

Metrics

View Details
calame-pt

Metrics

View Details
censorship-0-shot

Metrics

View Details
creativity-0-shot

Metrics

View Details
crimestats

Metrics

View Details
csl

Metrics

View Details
czech-restaurant-information

Metrics

View Details
dailydialog

Metrics

View Details
dart

Metrics

View Details
drop-3-shot

Metrics

View Details
enem-challenge-no-images

Metrics

View Details
faquad-nli

Metrics

View Details
gpqa-0-shot

Metrics

View Details
gsm8k-5-shot

Metrics

View Details
gsm8k-tr

Metrics

View Details
hatebr-binary

Metrics

View Details
hellaswag-10-shot

Metrics

View Details
hellaswag-pt

Metrics

View Details
hellaswag-tr

Metrics

View Details
humanness-0-shot

Metrics

View Details
ifeval-0-shot

Metrics

View Details
internet

Metrics

View Details
lambada-pt

Metrics

View Details
lcsts

Metrics

View Details
ldc2016e25

Metrics

View Details
math-lvl-5-4-shot

Metrics

View Details
mmlu-5-shot

Metrics

View Details
mmlu-pro-5-shot

Metrics

View Details
mmlu-tr

Metrics

View Details
mt-bench

Metrics

View Details
mt-bench-jp

Metrics

View Details
musr-0-shot

Metrics

View Details
oab-exams

Metrics

View Details
open-mindedness-0-shot

Metrics

View Details
openwebtext

Metrics

View Details
polcontro

Metrics

View Details
pt-hate-speech-binary

Metrics

View Details
sciq

Metrics

View Details
stories-jokes

Metrics

View Details
talking-0-shot

Metrics

View Details
truthfulqa

Metrics

View Details
truthfulqa-0-shot

Metrics

View Details
truthfulqa-pt

Metrics

View Details
truthfulqa-tr

Metrics

View Details
tweetsentbr

Metrics

View Details
unruly

Metrics

View Details
w-10

Metrics

View Details
wikitext-103

Metrics

View Details
winogrande-5-shot

Metrics

View Details
winogrande-tr

Metrics

View Details
world-knowledge-0-shot

Metrics

View Details