HyperAI

Video Based Generative Performance 5

Metrics

gpt-score

Results

Performance results of various models on this benchmark

Comparison Table
Model Namegpt-score
ppllava-varied-video-sequence-understanding3.21
videochat-chat-centric-video-understanding1.94
mvbench-a-comprehensive-multi-modal-video2.66
slowfast-llava-a-strong-training-free2.77
llama-adapter-v2-parameter-efficient-visual1.98
pllava-parameter-free-llava-extension-from-12.67
one-for-all-video-conversation-is-feasible2.13
video-llama-an-instruction-tuned-audio-visual1.82
ts-llava-constructing-visual-tokens-through2.77
moviechat-from-dense-token-to-sparse-memory2.24
vtimellm-empower-llm-to-grasp-video-moments2.49
videogpt-integrating-image-and-video-encoders2.83
chat-univi-unified-visual-representation2.39
one-for-all-video-conversation-is-feasible2.34
st-llm-large-language-models-are-effective-12.93
minigpt4-video-advancing-multimodal-llms-for2.65
video-chatgpt-towards-detailed-video1.98
mvbench-a-comprehensive-multi-modal-video2.65