Video Question Answering On Sutd Trafficqa
Métriques
1/4
Résultats
Résultats de performance de divers modèles sur ce benchmark
Nom du modèle | 1/4 | Paper Title | Repository |
---|---|---|---|
CFMMC-Align | 50.2 | - | - |
VIS+LST | 29.91 | Exploring Models and Data for Image Question Answering | |
HCRN | 36.49 | Hierarchical Conditional Relation Networks for Video Question Answering | |
Tem-adapter | 46.0 | Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer | |
Eclipse | 37.05 | SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events | |
TVQA | 35.16 | TVQA: Localized, Compositional Video Question Answering |
0 of 6 row(s) selected.