HyperAI

Dialogue Evaluation On Usr Topicalchat

Metriken

Pearson Correlation
Spearman Correlation

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnamePearson CorrelationSpearman Correlation
usr-an-unsupervised-and-reference-free0.42200.4192
proxy-indicators-for-the-quality-of-open0.49740.4877
mdd-eval-self-training-on-augmented-data-for0.45750.5109
usr-an-unsupervised-and-reference-free0.40680.3245
usr-an-unsupervised-and-reference-free0.32210.1419
usr-an-unsupervised-and-reference-free0.33450.3086