HyperAI

Image Captioning On Nocaps Val

Metrics

CIDEr
SPICE

Results

Performance results of various models on this benchmark

Comparison Table
Model NameCIDErSPICE
language-models-are-general-purpose58.78.6
prismer-a-vision-language-model-with-an107.914.8
unifying-vision-and-language-tasks-via-text4.4 5.3