HyperAI超神经

Video Captioning On Youcook2

评估指标

BLEU-4
CIDEr
METEOR
ROUGE-L

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称BLEU-4CIDErMETEORROUGE-L
howtocaption-prompting-llms-to-transform8.8116.415.937.3
vast-a-vision-audio-subtitle-text-omni-118.21.99--
videobert-a-joint-model-for-video-and4.330.5511.9428.80
ma-lmm-memory-augmented-large-multimodal-1.3117.6-
omnivl-one-foundation-model-for-image8.721.1614.8336.09
cosa-concatenated-sample-pretrained-vision10.11.31--
meltr-meta-loss-transformer-for-learning-to17.921.9022.5647.04
univilm-a-unified-video-and-language-pre17.351.8122.3546.52
multimodal-pretraining-for-dense-video12.041.2218.3239.03
text-with-knowledge-graph-augmented11.71.3314.840.2
vlm-task-agnostic-video-language-model-pre12.271.386918.2241.51
video-text-modeling-with-zero-shot-transfer14.21.28-37.7
end-to-end-dense-video-captioning-with-masked4.380.3811.5527.44
coot-cooperative-hierarchical-transformer-for11.300.5719.8537.94