Visual Question Answering Vqa On Ai2D
المقاييس
EM
النتائج
نتائج أداء النماذج المختلفة على هذا المعيار القياسي
| Paper Title | ||
|---|---|---|
| SMoLA-PaLI-X Specialist Model | 82.5 | Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts |
| SMoLA-PaLI-X Generalist Model | 81.4 | Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts |
| Gemini Ultra | 79.5 | Gemini: A Family of Highly Capable Multimodal Models |
| DUBLIN | 51.11 | DUBLIN -- Document Understanding By Language-Image Network |
0 of 4 row(s) selected.