HyperAI

Image Captioning On Nocaps Val Out Domain

Metriken

CIDEr
SPICE

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameCIDErSPICE
conceptual-12m-pushing-web-scale-image-text94.511.9
blip-bootstrapping-language-image-pre111.514.2
blip-bootstrapping-language-image-pre115.3 14.4
blip-2-bootstrapping-language-image-pre124.815.1
omnivl-one-foundation-model-for-image106.314.2
blip-2-bootstrapping-language-image-pre124.414.8
scaling-up-vision-language-pre-training-for111.3 14.0
blip-2-bootstrapping-language-image-pre123.415.1
simvlm-simple-visual-language-model115.2-
vinvl-making-visual-representations-matter-in 88.3 12.1