HyperAI

Image Retrieval On Photochat

Metriken

R1
R@10
R@5
Sum(R@1,5,10)

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameR1R@10R@5Sum(R@1,5,10)
vlmo-unified-vision-language-pre-training11.539.430.083.2
vilt-vision-and-language-transformer-without11.525.633.871.0
pace-unified-multi-modal-dialogue-pre15.249.636.7101.5
stacked-cross-attention-for-image-text10.437.127.074.5
photochat-a-human-human-dialogue-dataset-with9.035.726.471.1