Visual Question Answering Vqa On 3
評価指標
Question Pair Acc
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
モデル名 | Question Pair Acc | Paper Title | Repository |
---|---|---|---|
GPT-4V | - | HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | |
mPLUG-Owl | 2.36 | mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality | |
LRV-Instruct | - | Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | |
LLaVA-1.5 | - | - | - |
0 of 4 row(s) selected.