HyperAI

Image Captioning On Coco

Metrics

BLEU-4
CIDEr

Results

Performance results of various models on this benchmark

Comparison Table
Model NameBLEU-4CIDEr
unimo-towards-unified-modal-understanding-and39.6127.7
retrieval-augmented-multimodal-language-20.2
analog-bits-generating-discrete-data-using34.7115
retrieval-augmented-multimodal-language-103
retrieval-augmented-multimodal-language-38.7
retrieval-augmented-multimodal-language-48
Model 739.9131.0
retrieval-augmented-multimodal-language-83.9
retrieval-augmented-multimodal-language-55.8
reflective-decoding-network-for-image-125.2
expansionnet-v2-block-static-expansion-in-143.7
retrieval-augmented-multimodal-language-71.9
retrieval-augmented-multimodal-language-89.1
m2-meshed-memory-transformer-for-image-131.2
retrieval-augmented-multimodal-language-85
cutmix-regularization-strategy-to-train24.977.6
lyrics-boosting-fine-grained-language-vision-121.1