HyperAI

Image Retrieval On Flickr30K 1K Test

Metrics

R@1
R@10
R@5

Results

Performance results of various models on this benchmark

Comparison Table
Model NameR@1R@10R@5
fine-grained-visual-textual-alignment-for56.588.281.2
dual-attention-networks-for-multimodal39.479.169.2
visualsparta-sparse-transformer-fragment57.488.182.0
linking-image-and-text-with-2-way-nets36.0--
deep-visual-semantic-alignments-for15.250.5-
plug-and-play-regulators-for-image-text62.691.185.8
stacked-cross-attention-for-image-text44.082.674.2
flickr30k-entities-collecting-region-to24.766.853.4
learning-semantic-concepts-and-order-for41.180.170.5
camp-cross-modal-adaptive-message-passing-for51.585.377.1
multimodal-convolutional-neural-networks-for26.269.656.3
a-deep-local-and-global-scene-graph-matching57.490.284.1
learning-deep-structure-preserving-image-text29.772.160.1
visual-semantic-reasoning-for-image-text54.788.281.8
instance-aware-image-and-sentence-matching30.272.3-
multi-grained-vision-language-pre-training86.998.797.3
similarity-reasoning-and-filtration-for-image58.588.883.0
fine-grained-visual-textual-alignment-for55.789.383.1