HyperAIHyperAI超神经
首页资讯论文教程数据集百科SOTALLM 模型天梯GPU 天梯顶会
全站搜索
关于
中文
HyperAIHyperAI超神经
  1. 首页
  2. SOTA
  3. 视频检索
  4. Video Retrieval On Vatex

Video Retrieval On Vatex

评估指标

text-to-video R@1
text-to-video R@10
text-to-video R@5

评测结果

各个模型在此基准测试上的表现结果

模型名称
text-to-video R@1
text-to-video R@10
text-to-video R@5
Paper TitleRepository
VAST83.099.298.2VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
QB-Norm+CLIP2Video58.893.8-Cross Modal Retrieval with Querybank Normalisation
CLIP2Video57.390-CLIP2Video: Mastering Video-Text Retrieval via Image CLIP
Side4Video68.897.093.5Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning
VALOR78.598.797.1VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
Cap4Video66.697.093.1Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
InternVideo2-6B75.5--InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
GRAM87.7100-Gramian Multimodal Representation Learning and Alignment
TS2-Net59.195.2-TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
LAFF59.191.7-Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
Unmasked Teacher7297.895.1Unmasked Teacher: Towards Training-Efficient Video Foundation Models
InternVideo71.1--InternVideo: General Video Foundation Models via Generative and Discriminative Learning
TeachCLIP63.696.191.9Holistic Features are almost Sufficient for Text-to-Video Retrieval-
0 of 13 row(s) selected.
HyperAI

学习、理解、实践,与社区一起构建人工智能的未来

中文

关于

关于我们数据集帮助

产品

资讯教程数据集百科

链接

TVM 中文Apache TVMOpenBayes

© HyperAI超神经

津ICP备17010941号-1京公网安备11010502038810号京公网安备11010502038810号
TwitterBilibili