HyperAIHyperAI

Visual Question Answering Vqa On Imagenet

Métriques

ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch

Résultats

Résultats de performance de divers modèles sur ce benchmark

Nom du modèle
ClipMatch@1
ClipMatch@5
Contains
ExactMatch
Follow-up ClipMatch@1
Follow-up ClipMatch@5
Follow-up Contains
Follow-up ExactMatch
Paper TitleRepository
BLIP-2 OPT57.1077.2435.490.8767.2283.5440.312.54Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
0 of 1 row(s) selected.
Visual Question Answering Vqa On Imagenet | SOTA | HyperAI