Video Question Answering On Perception Test
Metrics
Accuracy (Top-1)
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Accuracy (Top-1) |
---|---|
traveler-a-multi-lmm-agent-framework-for | 50.2 |
videollama-2-advancing-spatial-temporal | 57.5 |
internvideo2-scaling-video-foundation-models | 63.4 |
bimba-selective-scan-compression-for-long | 68.51 |
oryx-mllm-on-demand-spatial-temporal | 71.4 |
perception-test-a-diagnostic-benchmark-for-2 | 0.46 |