Home News Papers Tutorials Datasets Wiki SOTA LLM Models GPU Leaderboard Events

English

Visual Question Answering Vqa On 5

Metrics

Overall Accuracy

Results

Performance results of various models on this benchmark

Model Name	Overall Accuracy	Paper Title	Repository
GPT-4V	66.0	AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
LLaVA-1.5	44.5	Improved Baselines with Visual Instruction Tuning
Claude 3	37.1	-	-
Gemini Pro Vision	51.4	-	-
miniGPT4	51.0	MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models

0 of 5 row(s) selected.