HyperAI

Image Captioning On Nocaps Xd Out Of Domain

Métriques

B1
B2
B3
B4
CIDEr
METEOR
ROUGE-L
SPICE

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleB1B2B3B4CIDErMETEORROUGE-LSPICE
Modèle 166.5444.2824.2310.1730.0918.2944.848.08
Modèle 275.5956.7135.6317.7285.2823.7751.9211.28
Modèle 371.5748.5825.779.6866.6720.8847.139.74
git-a-generative-image-to-text-transformer86.2871.1552.3630.15122.2730.1560.9115.62
Modèle 579.5961.0440.0919.6190.3426.1454.8613.11
Modèle 674.8453.933.5116.691.6226.8351.514.21
vivo-surpassing-human-performance-in-novel79.4461.1541.0321.7995.526.5655.4912.66
Modèle 864.4542.821.487.9248.7318.3144.118.2
Modèle 974.553.6330.9113.4177.9423.4749.6611.07
Modèle 1065.9843.221.167.558.4819.0444.478.77
git-a-generative-image-to-text-transformer85.9971.2852.6630.04122.0430.4560.9615.7