Visual Question Answering Vqa On 3
评估指标
Question Pair Acc
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Question Pair Acc | Paper Title | Repository |
---|---|---|---|
GPT-4V | - | HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | |
mPLUG-Owl | 2.36 | mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality | |
LRV-Instruct | - | Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | |
LLaVA-1.5 | - | - | - |
0 of 4 row(s) selected.