HyperAI초신경

Long Context Understanding On Mmneedle

평가 지표

1 Image, 2*2 Stitching, Exact Accuracy
1 Image, 4*4 Stitching, Exact Accuracy
1 Image, 8*8 Stitching, Exact Accuracy
10 Images, 1*1 Stitching, Exact Accuracy
10 Images, 2*2 Stitching, Exact Accuracy
10 Images, 4*4 Stitching, Exact Accuracy
10 Images, 8*8 Stitching, Exact Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름1 Image, 2*2 Stitching, Exact Accuracy1 Image, 4*4 Stitching, Exact Accuracy1 Image, 8*8 Stitching, Exact Accuracy10 Images, 1*1 Stitching, Exact Accuracy10 Images, 2*2 Stitching, Exact Accuracy10 Images, 4*4 Stitching, Exact Accuracy10 Images, 8*8 Stitching, Exact Accuracy
cogvlm-visual-expert-for-pretrained-language7.30.90.10000
instructblip-towards-general-purpose-vision0000000
instructblip-towards-general-purpose-vision3.86.22.20000
gpt-4-technical-report-186.0954.727.372.3634.247.580
what-matters-when-building-vision-language18.97.80.90000
gemini-a-family-of-highly-capable-multimodal-129.5324.782.1116.254.820.40
gemini-1-5-unlocking-multimodal-understanding90.3439.8529.8189.9445.216.090.62
cogvlm-visual-expert-for-pretrained-language00.10.30000
the-claude-3-model-family-opus-sonnet-haiku52.2512.31.666.934.60.40
mplug-owl2-revolutionizing-multi-modal-large1.90.30.70.40.100
llava-uhd-an-lmm-perceiving-any-aspect-ratio43.817.53.30000
gpt-4-technical-report-194.683199781.826.91