Coreference Resolution On Conll12
Métriques
Average F1
B3
CEAFϕ4
MUC
Résultats
Résultats de performance de divers modèles sur ce benchmark
Nom du modèle | Average F1 | B3 | CEAFϕ4 | MUC | Paper Title | Repository |
---|---|---|---|---|---|---|
DeepStruct multi-task | 60.6 | 57.7 | 60.2 | 63.9 | DeepStruct: Pretraining of Language Models for Structure Prediction | - |
DeepStruct multi-task w/ finetune | 73.1 | 71.3 | 73.1 | 74.9 | DeepStruct: Pretraining of Language Models for Structure Prediction | - |
0 of 2 row(s) selected.