Video Captioning On Msrvtt Ctn
Metrics
CIDEr
ROUGE-L
SPICE
Results
Performance results of various models on this benchmark
Model Name | CIDEr | ROUGE-L | SPICE | Paper Title | Repository |
---|---|---|---|---|---|
CEN | 49.87 | 27.90 | 15.76 | NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative | - |
SEM-POS | 26.01 | 20.11 | 12.09 | SEM-POS: Grammatically and Semantically Correct Video Captioning | - |
AKGNN | 25.90 | 21.42 | 11.99 | Action knowledge for video captioning with graph neural networks | |
GIT | 32.43 | 24.51 | 13.70 | GiT: Towards Generalist Vision Transformer through Universal Language Interface |
0 of 4 row(s) selected.