Visual Question Answering Vqa On 3
Metriken
Question Pair Acc
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Modellname | Question Pair Acc | Paper Title | Repository |
---|---|---|---|
GPT-4V | - | HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | |
mPLUG-Owl | 2.36 | mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality | |
LRV-Instruct | - | Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | |
LLaVA-1.5 | - | - | - |
0 of 4 row(s) selected.