HyperAI

Coherence Evaluation On Gcdc Rst F1

Metrics

Average F1

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAverage F1
discourse-coherence-in-the-wild-a-dataset46.65
neural-rst-based-evaluation-of-discourse46.98
neural-rst-based-evaluation-of-discourse44.30