HyperAI

Long Video Retrieval Background Removed On

Métriques

Cap. Avg. R@1
Cap. Avg. R@10
Cap. Avg. R@5
DTW R@1
DTW R@10
DTW R@5
OTAM R@1
OTAM R@10
OTAM R@5

Résultats

Résultats de performance de divers modèles sur ce benchmark

Nom du modèle
Cap. Avg. R@1
Cap. Avg. R@10
Cap. Avg. R@5
DTW R@1
DTW R@10
DTW R@5
OTAM R@1
OTAM R@10
OTAM R@5
Paper TitleRepository
Norton75.597.795.088.799.598.888.999.598.4Multi-granularity Correspondence Learning from Long-term Noisy Videos
MCN53.481.475.0------Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
VideoCLIP74.597.994.556.089.996.352.889.295.0VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
MIL-NCE43.179.168.6------End-to-End Learning of Visual Representations from Uncurated Instructional Videos
TempCLR74.597.094.683.599.397.284.999.597.9TempCLR: Temporal Alignment Representation with Contrastive Learning
Text-Video Embedding46.683.774.3------HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
0 of 6 row(s) selected.