HyperAI

Long Context Understanding On Mmneedle

Métriques

1 Image, 2*2 Stitching, Exact Accuracy
1 Image, 4*4 Stitching, Exact Accuracy
1 Image, 8*8 Stitching, Exact Accuracy
10 Images, 1*1 Stitching, Exact Accuracy
10 Images, 2*2 Stitching, Exact Accuracy
10 Images, 4*4 Stitching, Exact Accuracy
10 Images, 8*8 Stitching, Exact Accuracy

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèle1 Image, 2*2 Stitching, Exact Accuracy1 Image, 4*4 Stitching, Exact Accuracy1 Image, 8*8 Stitching, Exact Accuracy10 Images, 1*1 Stitching, Exact Accuracy10 Images, 2*2 Stitching, Exact Accuracy10 Images, 4*4 Stitching, Exact Accuracy10 Images, 8*8 Stitching, Exact Accuracy
cogvlm-visual-expert-for-pretrained-language7.30.90.10000
instructblip-towards-general-purpose-vision0000000
instructblip-towards-general-purpose-vision3.86.22.20000
gpt-4-technical-report-186.0954.727.372.3634.247.580
what-matters-when-building-vision-language18.97.80.90000
gemini-a-family-of-highly-capable-multimodal-129.5324.782.1116.254.820.40
gemini-1-5-unlocking-multimodal-understanding90.3439.8529.8189.9445.216.090.62
cogvlm-visual-expert-for-pretrained-language00.10.30000
the-claude-3-model-family-opus-sonnet-haiku52.2512.31.666.934.60.40
mplug-owl2-revolutionizing-multi-modal-large1.90.30.70.40.100
llava-uhd-an-lmm-perceiving-any-aspect-ratio43.817.53.30000
gpt-4-technical-report-194.683199781.826.91