Dialogue Evaluation On Usr Personachat
Metrics
Pearson Correlation
Spearman Correlation
Results
Performance results of various models on this benchmark
Model Name | Pearson Correlation | Spearman Correlation | Paper Title | Repository |
---|---|---|---|---|
Lin-Reg (all) | 0.5290 | 0.5382 | Proxy Indicators for the Quality of Open-domain Dialogues | - |
USR - MLM | 0.0788 | 0.0795 | USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | |
USR - DR (x = f) | -0.0454 | -0.0495 | USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | |
USR - DR (x = c) | 0.6087 | 0.4814 | USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | |
USR | 0.4115 | 0.4693 | USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation |
0 of 5 row(s) selected.