HyperAI超神経

Dialogue Evaluation On Usr Topicalchat

評価指標

Pearson Correlation
Spearman Correlation

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Pearson CorrelationSpearman Correlation
usr-an-unsupervised-and-reference-free0.42200.4192
proxy-indicators-for-the-quality-of-open0.49740.4877
mdd-eval-self-training-on-augmented-data-for0.45750.5109
usr-an-unsupervised-and-reference-free0.40680.3245
usr-an-unsupervised-and-reference-free0.32210.1419
usr-an-unsupervised-and-reference-free0.33450.3086