HyperAI超神経

Video Based Generative Performance 3

評価指標

gpt-score

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名gpt-score
chat-univi-unified-visual-representation3.46
ts-llava-constructing-visual-tokens-through3.86
videogpt-integrating-image-and-video-encoders3.74
mvbench-a-comprehensive-multi-modal-video3.64
pllava-parameter-free-llava-extension-from-13.9
video-llama-an-instruction-tuned-audio-visual2.16
video-chatgpt-towards-detailed-video2.62
llama-adapter-v2-parameter-efficient-visual2.30
vtimellm-empower-llm-to-grasp-video-moments3.40
slowfast-llava-a-strong-training-free3.84
st-llm-large-language-models-are-effective-13.74
videochat-chat-centric-video-understanding2.53
one-for-all-video-conversation-is-feasible3.27
minigpt4-video-advancing-multimodal-llms-for3.57
moviechat-from-dense-token-to-sparse-memory3.01
ppllava-varied-video-sequence-understanding4.21
mvbench-a-comprehensive-multi-modal-video3.51
one-for-all-video-conversation-is-feasible2.89