HyperAI超神経

Zero Shot Video Question Answer On Video Mme 1

評価指標

Accuracy (%)

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Accuracy (%)
gpt-4o-visual-perception-performance-of68.9
videollama-2-advancing-spatial-temporal63.1
bimba-selective-scan-compression-for-long64.67
video-rag-visually-aligned-retrieval77.4
vila-on-pre-training-for-visual-language64.1
gemini-1-5-unlocking-multimodal-understanding81.3
longvu-spatiotemporal-adaptive-compression60.6
2408-0180063.7
gemini-1-5-unlocking-multimodal-understanding75.0
gpt-4o-visual-perception-performance-of77.2