HyperAI

Video Question Answering On Agqa 2 0 Balanced

Metrics

Average Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAverage Accuracy
mist-multi-modal-iterative-spatial-temporal50.96
glance-and-focus-memory-prompting-for-multi-153.33
mist-multi-modal-iterative-spatial-temporal54.39
learning-situation-hyper-graphs-for-video49.2
glance-and-focus-memory-prompting-for-multi-148.59
mmtf-multi-modal-temporal-fusion-for44.36
svitt-temporal-learning-of-sparse-video-text52.7
glance-and-focus-memory-prompting-for-multi-155.08