Question Answering On Muld Hotpotqa

BLEU-1

BLEU-4

METEOR

Rouge-L

평가 결과

이 벤치마크에서 각 모델의 성능 결과

					Paper Title	Repository
Longformer	30.38	16.76	4.98	30.49	MuLD: The Multitask Long Document Benchmark
T5	28.11	13.63	4.46	27.61	MuLD: The Multitask Long Document Benchmark

0 of 2 row(s) selected.