HyperAI

Zeroshot Video Question Answer On Tgif Qa

Metriken

Accuracy
Confidence Score

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameAccuracyConfidence Score
elysium-exploring-object-level-perception-in66.63.6
minigpt4-video-advancing-multimodal-llms-for72.22-
ts-llava-constructing-visual-tokens-through81.04.2
videochat-chat-centric-video-understanding34.42.3
slowfast-llava-a-strong-training-free80.64.3
video-llava-learning-united-visual-170.04.0
videogpt-integrating-image-and-video-encoders74.64.1
linvt-empower-your-image-level-large-language81.34.3
an-image-grid-can-be-worth-a-video-zero-shot79.14.2
video-chatgpt-towards-detailed-video51.43.0
pllava-parameter-free-llava-extension-from-180.64.3
chat-univi-unified-visual-representation69.03.8
zero-shot-video-question-answering-via-frozen41.9-
tarsier-recipes-for-training-and-evaluating-182.54.4