HyperAI

Human Judgment Correlation On Flickr8K Expert

Metrics

Kendall's Tau-c

Results

Performance results of various models on this benchmark

Comparison Table
Model NameKendall's Tau-c
clipscore-a-reference-free-evaluation-metric51.2
factual-a-benchmark-for-faithful-and54.2
mutual-information-divergence-a-unified54.9
clipscore-a-reference-free-evaluation-metric53.0