HyperAI

Video Retrieval On Condensed Movies

Metrics

text-to-video R@1
text-to-video R@10
text-to-video R@5

Results

Performance results of various models on this benchmark

Comparison Table
Model Nametext-to-video R@1text-to-video R@10text-to-video R@5
vindlu-a-recipe-for-effective-video-and18.444.336.4
long-form-video-language-pre-training-with13.641.832.5
testa-temporal-spatial-token-aggregation-for24.955.146.5