Video Question Answering
Benchmark List
All benchmarks related to this task
activitynet-qa
Best model: VideoChat2
Metrics
View Details
agqa-2-0-balanced
Best model: GF (sup) - Faster RCNN
Metrics
View Details
how2qa
Best model: Text + Text (no Multimodal Pretext Training)
Metrics
View Details
howto100m-qa
Best model: TimeSformer
Metrics
View Details
intentqa
Best model: VideoChat2_mistral
Metrics
View Details
ivqa
Best model: FrozenBiLM
Metrics
View Details
lsmdc-fib
Best model: Clover
Metrics
View Details
lsmdc-mc
Best model: VIOLETv2
Metrics
View Details
msr-vtt-mc
Best model: ATP (1<-16)
Metrics
View Details
msrvtt-mc
Best model: Singularity-temporal
Metrics
View Details
msrvtt-qa
Best model: FrozenBiLM
Metrics
View Details
mvbench
Best model: Tarsier (34B)
Metrics
View Details
next-qa
Best model: LinVT-Qwen2-VL
(7B)
Metrics
View Details
next-qa-efficient
Best model: ViLA (3B, 4 frames)
Metrics
View Details
perception-test
Best model: Oyrx (34B)
Metrics
View Details
roadtextvqa
Best model: GIT
Metrics
View Details
situated-reasoning-star
Best model: VLAP (4 frames)
Metrics
View Details
tvbench
Best model: Tarsier-34B
Metrics
View Details
tvqa
Best model: LLaMA-VQA
Metrics
View Details
videoqa
Best model: Just Ask (fine-tune)
Metrics
View Details
dramaqa
Metrics
View Details
msr-vtt
Metrics
View Details
msvd-qa
Metrics
View Details
trafficqa
Metrics
View Details
tgif-qa
Metrics
View Details
vlep
Metrics
View Details
wildqa
Metrics
View Details