HyperAIHyperAI

Long Video Retrieval Background Removed On

Metriken

Cap. Avg. R@1
Cap. Avg. R@10
Cap. Avg. R@5
DTW R@1
DTW R@10
DTW R@5
OTAM R@1
OTAM R@10
OTAM R@5

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
Cap. Avg. R@1
Cap. Avg. R@10
Cap. Avg. R@5
DTW R@1
DTW R@10
DTW R@5
OTAM R@1
OTAM R@10
OTAM R@5
Paper TitleRepository
Norton75.597.795.088.799.598.888.999.598.4Multi-granularity Correspondence Learning from Long-term Noisy Videos-
MCN53.481.475.0------Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos-
VideoCLIP74.597.994.556.089.996.352.889.295.0VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding-
MIL-NCE43.179.168.6------End-to-End Learning of Visual Representations from Uncurated Instructional Videos-
TempCLR74.597.094.683.599.397.284.999.597.9TempCLR: Temporal Alignment Representation with Contrastive Learning-
Text-Video Embedding46.683.774.3------HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips-
0 of 6 row(s) selected.