HyperAI

Video Captioning On Activitynet Captions

Metriken

BLEU4
CIDEr
METEOR

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
BLEU4
CIDEr
METEOR
Paper TitleRepository
MART (ae-test split) - Appearance + Flow10.3323.4215.68MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
COOT (ae-test split) - Only Appearance features10.8528.1915.99COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
VLTinT (ae-test split) C3D/Ling14.531.1317.97VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
VideoCoCa14.739.3-VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners-
VLCap (ae-test split) - Appearance + Language13.3831.2917.48VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
0 of 5 row(s) selected.