Video Question Answering On Intentqa
评估指标
Accuarcy
CH
CW
TPu0026TN
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Accuarcy | CH | CW | TPu0026TN | Paper Title | Repository |
|---|---|---|---|---|---|---|
| VideoChat2_mistral | 81.9 | 86.9 | 82.6 | 77.0 | MVBench: A Comprehensive Multi-modal Video Understanding Benchmark | |
| HQGA | 47.7 | 54.3 | 48.2 | 41.7 | Video as Conditional Graph Hierarchy for Multi-Granular Question Answering | |
| VGT | 51.3 | 56.0 | 51.4 | 47.6 | Video Graph Transformer for Video Question Answering | |
| IntentQA | 57.6 | 65.5 | 58.4 | 50.5 | IntentQA: Context-aware Video Intent Reasoning | - |
| VideoChat2_HD_mistral | 83.4 | 90.0 | 84.0 | 77.3 | MVBench: A Comprehensive Multi-modal Video Understanding Benchmark | |
| Human | 78.5 | 80.2 | 77.8 | 79.1 | IntentQA: Context-aware Video Intent Reasoning | - |
0 of 6 row(s) selected.