Video Captioning On Tvc
評価指標
BLEU-4
CIDEr
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
モデル名 | BLEU-4 | CIDEr | Paper Title | Repository |
---|---|---|---|---|
VAST | 19.9 | 74.1 | VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset | |
COSA | 18.8 | 70.7 | COSA: Concatenated Sample Pretrained Vision-Language Foundation Model |
0 of 2 row(s) selected.