HyperAI

Video Question Answering On Perception Test

Metrics

Accuracy (Top-1)

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAccuracy (Top-1)
traveler-a-multi-lmm-agent-framework-for50.2
videollama-2-advancing-spatial-temporal57.5
internvideo2-scaling-video-foundation-models63.4
bimba-selective-scan-compression-for-long68.51
oryx-mllm-on-demand-spatial-temporal71.4
perception-test-a-diagnostic-benchmark-for-20.46