Visual Question Answering Vqa On Pmc Vqa
評価指標
Accuracy
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
モデル名 | Accuracy | Paper Title | Repository |
---|---|---|---|
BLIP-2 | 24.3 | BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models | |
Open-Flamingo | 26.4 | Flamingo: a Visual Language Model for Few-Shot Learning | |
PMC-CLIP | 24.7 | PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents | |
MedVInT | 42.3 | PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering |
0 of 4 row(s) selected.