HyperAI초신경

Mmr Total On Mrr Benchmark

평가 지표

Total Column Score

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Total Column Score
internvl-scaling-up-vision-foundation-models368
obelics-an-open-web-scale-filtered-dataset-of-1139
what-matters-when-building-vision-language256
internvl-scaling-up-vision-foundation-models237
gpt-4o-visual-perception-performance-of457
phi-3-technical-report-a-highly-capable397
qwen-vl-a-frontier-large-vision-language366
visual-instruction-tuning-1335
visual-instruction-tuning-1412
the-dawn-of-lmms-preliminary-explorations415
claude-3-5-sonnet-model-card-addendum463
monkey-image-resolution-and-text-label-are214
qwen-vl-a-frontier-large-vision-language310
visual-instruction-tuning-1243