HyperAIHyperAI

Video Retrieval On Fivr 200K

Metriken

mAP (CSVR)
mAP (DSVR)
mAP (ISVR)

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
mAP (CSVR)
mAP (DSVR)
mAP (ISVR)
Paper TitleRepository
ViSiLf0.7970.8430.660ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
TCAc0.5530.570 0.473Temporal Context Aggregation for Video Retrieval with Contrastive Learning
DnS (S^f_B)0.8630.9090.729DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval
ViSiLsym0.7920.8330.654ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
VRAG (CS)0.6780.7230.554VRAG: Region Attention Graphs for Content-Based Video Retrieval-
ViSiLv (pt)0.8540.8990.723ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
DnS (S^c)0.558 0.574 0.476DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval
S2VS0.8790.9270.746Self-Supervised Video Similarity Learning
Jo et al. (SCFV+TNIP)0.8330.8960.674Exploring the Temporal Cues to Enhance Video Retrieval on Standardized CDVA-
TCAf0.830 0.8770.703Temporal Context Aggregation for Video Retrieval with Contrastive Learning
TCAsym0.6980.7280.592Temporal Context Aggregation for Video Retrieval with Contrastive Learning
ViSiLv (tf)0.841 0.8920.702ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
VVS0.6890.711 0.590VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression
S2VS0.8780.9250.739Self-Supervised Video Similarity Learning
DnS (S^f_A)0..8750.921 0.741DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval
VRAG (video)0.4700.4840.399VRAG: Region Attention Graphs for Content-Based Video Retrieval-
LAMV0.4660.4960.371LAMV: Learning to Align and Match Videos With Kernelized Temporal Layers-
0 of 17 row(s) selected.