Visual Instruction Following On Llava Bench
評価指標
avg score
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
比較表
モデル名 | avg score |
---|---|
sharegpt4v-improving-large-multi-modal-models | 79.9 |
cumo-scaling-multimodal-llm-with-co-upcycled | 85.7 |
improved-baselines-with-visual-instruction | 70.7 |
instructblip-towards-general-purpose-vision | 58.2 |
blip-2-bootstrapping-language-image-pre | 38.1 |
sharegpt4v-improving-large-multi-modal-models | 72.6 |
instructblip-towards-general-purpose-vision | 60.9 |
improved-baselines-with-visual-instruction | 63.4 |