Human Judgment Correlation On Flickr8K Expert
Metriken
Kendall's Tau-c
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Modellname | Kendall's Tau-c | Paper Title | Repository |
---|---|---|---|
CLIP-S | 51.2 | CLIPScore: A Reference-free Evaluation Metric for Image Captioning | |
SoftSPICE | 54.2 | FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing | |
MID | 54.9 | Mutual Information Divergence: A Unified Metric for Multimodal Generative Models | |
RefCLIP-S | 53.0 | CLIPScore: A Reference-free Evaluation Metric for Image Captioning |
0 of 4 row(s) selected.