Text Summarization
Text summarization is a task in natural language processing that aims to compress long documents into shorter, more concise versions while retaining the core information and meaning of the original text. Its goal is to produce summaries that accurately reflect the original content, enabling users to quickly grasp key information. This task encompasses both extractive and abstractive methods; the former identifies and extracts important sentences or phrases, while the latter generates new text based on the content of the original document. Text summarization has significant application value in areas such as news reporting, scientific literature, and business reports.
ACI-Bench
CriSPO 3-shot
AMI
Arxiv HEP-TH citation graph
arXiv
BigBird-Pegasus
arXiv Summarization Dataset
PRIMER
BBC XSum
MatchSum
BigPatent
LongT5
BillSum
Longformer Encoder Decoder
BookSum
Echoes-Extractive-Abstractive
CL-SciSumm
CNN / Daily Mail (Anonymized)
CORD-19
DialogSum
InstructDS
DUC 2004 Task 1
Transformer+WDrop
EurekaAlert
Gazeta
Finetuned mBART
GigaWord
BART-RXF
GigaWord-10k
ERNIE-GENLARGE (large-scale text corpora)
GovReport
FactorSum
How2
Klexikon
Luhn's algorithm (25 sentences)
LCSTS
LSTM-seq2seq
MediaSum
SRformer-BART
MeetingBank
MentSum
MeQSum
BiomedGPT
MTEB
OrangeSum
mBARThez (OrangeSum abstract)
Pubmed
QMSum
BART-LS
Reddit TIFU
S2ORC
GenCompareSum
SAMSum
Webis-Snippet-20 Corpus
Anchor-context + Query biased
WikiHow
BertSum
X-Sum
Selfmem
XSum
SRformer-BART