Dialogue Evaluation On Usr Personachat
Metrics
Pearson Correlation
Spearman Correlation
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Pearson Correlation | Spearman Correlation |
---|---|---|
proxy-indicators-for-the-quality-of-open | 0.5290 | 0.5382 |
usr-an-unsupervised-and-reference-free | 0.0788 | 0.0795 |
usr-an-unsupervised-and-reference-free | -0.0454 | -0.0495 |
usr-an-unsupervised-and-reference-free | 0.6087 | 0.4814 |
usr-an-unsupervised-and-reference-free | 0.4115 | 0.4693 |