Question Answering On Muld Narrativeqa

BLEU-1

BLEU-4

METEOR

Rouge-L

평가 결과

이 벤치마크에서 각 모델의 성능 결과

					Paper Title	Repository
Longformer	19.84	62	4.52	22.09	MuLD: The Multitask Long Document Benchmark
T5	17.67	55	3.36	19.03	MuLD: The Multitask Long Document Benchmark

0 of 2 row(s) selected.