Coreference Resolution On Conll12

Average F1

CEAFϕ4

MUC

평가 결과

이 벤치마크에서 각 모델의 성능 결과

					Paper Title
DeepStruct multi-task w/ finetune	73.1	71.3	73.1	74.9	DeepStruct: Pretraining of Language Models for Structure Prediction
DeepStruct multi-task	60.6	57.7	60.2	63.9	DeepStruct: Pretraining of Language Models for Structure Prediction

0 of 2 row(s) selected.