Visual Question Answering Vqa On 5

Overall Accuracy

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

		Paper Title
GPT-4V	66.0	AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Gemini Pro Vision	51.4	-
miniGPT4	51.0	MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
LLaVA-1.5	44.5	Improved Baselines with Visual Instruction Tuning
Claude 3	37.1	-

0 of 5 row(s) selected.