Text Summarization On Reddit Tifu
평가 지표
ROUGE-1
ROUGE-2
ROUGE-L
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | ROUGE-1 | ROUGE-2 | ROUGE-L | Paper Title | Repository |
---|---|---|---|---|---|
MatchSum | 25.09 | 6.17 | 20.13 | Extractive Summarization as Text Matching | |
MUPPET BART Large | 30.3 | 11.25 | 24.92 | Muppet: Massive Multi-task Representations with Pre-Finetuning | |
PEGASUS + SummaReranker | 29.83 | 9.5 | 23.47 | SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization | |
PEGASUS 2B + SLiC | 32.03 | 11.13 | 25.51 | Calibrating Sequence likelihood Improves Conditional Language Generation | - |
BART+R3F | 30.31 | 10.98 | 24.74 | Better Fine-Tuning by Reducing Representational Collapse |
0 of 5 row(s) selected.