HyperAI초신경

Dialogue Evaluation On Usr Topicalchat

평가 지표

Pearson Correlation
Spearman Correlation

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Pearson CorrelationSpearman Correlation
usr-an-unsupervised-and-reference-free0.42200.4192
proxy-indicators-for-the-quality-of-open0.49740.4877
mdd-eval-self-training-on-augmented-data-for0.45750.5109
usr-an-unsupervised-and-reference-free0.40680.3245
usr-an-unsupervised-and-reference-free0.32210.1419
usr-an-unsupervised-and-reference-free0.33450.3086