HyperAI

Visual Reasoning On Bongard Openworld

Métriques

2-Class Accuracy

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèle2-Class Accuracy
bongard-openworld-few-shot-reasoning-for-free91.0
bongard-openworld-few-shot-reasoning-for-free49.3
cognitive-paradigms-for-evaluating-vlms-on92.8
cognitive-paradigms-for-evaluating-vlms-on93.6
bongard-openworld-few-shot-reasoning-for-free63.3
bongard-openworld-few-shot-reasoning-for-free55.5
bongard-openworld-few-shot-reasoning-for-free64.0
bongard-openworld-few-shot-reasoning-for-free49.3
bongard-openworld-few-shot-reasoning-for-free63.8