HyperAI

Zero Shot Video Retrieval On Vatex

Metriken

text-to-video R@1
text-to-video R@10
video-to-text R@1
video-to-text R@10

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
Modellnametext-to-video R@1text-to-video R@10video-to-text R@1video-to-text R@10
gramian-multimodal-representation-learning83.999.582.799
video-text-modeling-with-zero-shot-transfer53.290.173.697.2
internvideo2-scaling-video-foundation-models71.597.185.399.3
internvideo-general-video-foundation-models49.5-69.5-
internvideo2-scaling-video-foundation-models70.496.985.499.1