Visual Reasoning On Bongard Openworld
Métriques
2-Class Accuracy
Résultats
Résultats de performance de divers modèles sur ce benchmark
Tableau comparatif
Nom du modèle | 2-Class Accuracy |
---|---|
bongard-openworld-few-shot-reasoning-for-free | 91.0 |
bongard-openworld-few-shot-reasoning-for-free | 49.3 |
cognitive-paradigms-for-evaluating-vlms-on | 92.8 |
cognitive-paradigms-for-evaluating-vlms-on | 93.6 |
bongard-openworld-few-shot-reasoning-for-free | 63.3 |
bongard-openworld-few-shot-reasoning-for-free | 55.5 |
bongard-openworld-few-shot-reasoning-for-free | 64.0 |
bongard-openworld-few-shot-reasoning-for-free | 49.3 |
bongard-openworld-few-shot-reasoning-for-free | 63.8 |