HyperAI초신경

Visual Question Answering On Benchlmm

평가 지표

GPT-3.5 score

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름GPT-3.5 score
minigpt-4-enhancing-vision-language34.93
instructblip-towards-general-purpose-vision44.63
improved-baselines-with-visual-instruction55.53
sphinx-the-joint-mixing-of-weights-tasks-and57.43
visual-instruction-tuning-146.83
instructblip-towards-general-purpose-vision45.03
minigpt-v2-large-language-model-as-a-unified30.1
gpt-4-technical-report-158.37
visual-instruction-tuning-143.50
otter-a-multi-modal-model-with-in-context39.13