HyperAI초신경

Visual Reasoning On Nlvr2 Test

평가 지표

Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Accuracy
coca-contrastive-captioners-are-image-text87.0
uniter-learning-universal-image-text-179.5
simvlm-simple-visual-language-model85.15
vlmo-unified-vision-language-pre-training86.86
blip-bootstrapping-language-image-pre83.09
x-2-vlm-all-in-one-pre-trained-model-for89.4
x-2-vlm-all-in-one-pre-trained-model-for87.0
multi-grained-vision-language-pre-training84.76
seeing-out-of-the-box-end-to-end-pre-training77.32
lxmert-learning-cross-modality-encoder76.2
vilt-vision-and-language-transformer-without76.13
align-before-fuse-vision-and-language82.55
image-as-a-foreign-language-beit-pretraining92.58
toward-building-general-foundation-models-for88.4