Video Captioning On Tvc

BLEU-4

CIDEr

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	BLEU-4	CIDEr	Paper Title	Repository
VAST	19.9	74.1	VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
COSA	18.8	70.7	COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

0 of 2 row(s) selected.