HyperAI超神経

Video Captioning On Msr Vtt 1

評価指標

BLEU-4
CIDEr
METEOR
ROUGE-L

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名BLEU-4CIDErMETEORROUGE-L
howtocaption-prompting-llms-to-transform49.865.332.266.3
mplug-2-a-modularized-multi-modal-foundation57.880.034.970.1
vid2seq-large-scale-pretraining-of-a-visual-64.630.8-
video-text-modeling-with-zero-shot-transfer53.873.2-68.0
meltr-meta-loss-transformer-for-learning-to44.1752.7729.2662.35
vast-a-vision-audio-subtitle-text-omni-156.778.0--
hitea-hierarchical-temporal-aware-video49.265.130.765.0
text-with-knowledge-graph-augmented46.660.830.564.8
cosa-concatenated-sample-pretrained-vision53.774.7--
an-empirical-study-of-end-to-end-video-58--
accurate-and-fast-compressed-video-captioning44.457.230.363.4
icocap-improving-video-captioning-by47.060.231.164.9
vlab-enhancing-video-language-pre-training-by54.674.933.468.3
sem-pos-grammatically-and-semantically45.253.130.764.1
clip-meets-video-captioners-attribute-aware48.258.731.364.8
icocap-improving-video-captioning-by46.159.130.364.3
git-a-generative-image-to-text-transformer54.875.933.168.2
mammut-a-simple-architecture-for-joint-73.6--
rtq-rethinking-video-language-understanding49.669.3-66.1
expectation-maximization-contrastive-learning45.354.630.263.2
valor-vision-audio-language-omni-perception54.474.032.968.0
end-to-end-generative-pretraining-for48.960.038.764.0
diverse-video-captioning-by-adaptive-spatio44.2156.0830.2462.9
diverse-video-captioning-by-adaptive-spatio43.45530.262.5