Image Captioning On Nocaps Val Out Domain
평가 지표
CIDEr
SPICE
평가 결과
이 벤치마크에서 각 모델의 성능 결과
비교 표
모델 이름 | CIDEr | SPICE |
---|---|---|
conceptual-12m-pushing-web-scale-image-text | 94.5 | 11.9 |
blip-bootstrapping-language-image-pre | 111.5 | 14.2 |
blip-bootstrapping-language-image-pre | 115.3 | 14.4 |
blip-2-bootstrapping-language-image-pre | 124.8 | 15.1 |
omnivl-one-foundation-model-for-image | 106.3 | 14.2 |
blip-2-bootstrapping-language-image-pre | 124.4 | 14.8 |
scaling-up-vision-language-pre-training-for | 111.3 | 14.0 |
blip-2-bootstrapping-language-image-pre | 123.4 | 15.1 |
simvlm-simple-visual-language-model | 115.2 | - |
vinvl-making-visual-representations-matter-in | 88.3 | 12.1 |