Visual Question Answering Vqa On Imagenet

평가 지표

ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch
Paper TitleRepository
BLIP-2 OPT57.1077.2435.490.8767.2283.5440.312.54Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
0 of 1 row(s) selected.
Visual Question Answering Vqa On Imagenet | SOTA | HyperAI초신경