Visual Question Answering Vqa On 3

Question Pair Acc

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名	Question Pair Acc	Paper Title	Repository
GPT-4V	-	HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models
mPLUG-Owl	2.36	mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
LRV-Instruct	-	Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
LLaVA-1.5	-	-	-

0 of 4 row(s) selected.