Visual Question Answering Vqa On 3
المقاييس
Question Pair Acc
النتائج
نتائج أداء النماذج المختلفة على هذا المعيار القياسي
اسم النموذج | Question Pair Acc | Paper Title | Repository |
---|---|---|---|
GPT-4V | - | HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | |
mPLUG-Owl | 2.36 | mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality | |
LRV-Instruct | - | Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | |
LLaVA-1.5 | - | - | - |
0 of 4 row(s) selected.