HyperAI

Image Captioning On Nocaps Xd Out Of Domain

Metriken

B1
B2
B3
B4
CIDEr
METEOR
ROUGE-L
SPICE

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
B1
B2
B3
B4
CIDEr
METEOR
ROUGE-L
SPICE
Paper TitleRepository
UpDown66.5444.2824.2310.1730.0918.2944.848.08--
icp2ssi1_coco_si_0.02_5_test75.5956.7135.6317.7285.2823.7751.9211.28--
UpDown + ELMo + CBS71.5748.5825.779.6866.6720.8847.139.74--
GIT286.2871.1552.3630.15122.2730.1560.9115.62GIT: A Generative Image-to-text Transformer for Vision and Language
VLAF279.5961.0440.0919.6190.3426.1454.8613.11--
Human74.8453.933.5116.691.6226.8351.514.21--
Microsoft Cognitive Services team79.4461.1541.0321.7995.526.5655.4912.66VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning-
Neural Baby Talk64.4542.821.487.9248.7318.3144.118.2--
test_cbs274.553.6330.9113.4177.9423.4749.6611.07--
Neural Baby Talk + CBS65.9843.221.167.558.4819.0444.478.77--
GIT85.9971.2852.6630.04122.0430.4560.9615.7GIT: A Generative Image-to-text Transformer for Vision and Language
0 of 11 row(s) selected.