HyperAI超神経

Visual Instruction Following On Llava Bench

評価指標

avg score

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名avg score
sharegpt4v-improving-large-multi-modal-models79.9
cumo-scaling-multimodal-llm-with-co-upcycled85.7
improved-baselines-with-visual-instruction70.7
instructblip-towards-general-purpose-vision58.2
blip-2-bootstrapping-language-image-pre38.1
sharegpt4v-improving-large-multi-modal-models72.6
instructblip-towards-general-purpose-vision60.9
improved-baselines-with-visual-instruction63.4