HyperAI

Visual Reasoning On Bongard Openworld

Metriken

2-Class Accuracy

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
Modellname2-Class Accuracy
bongard-openworld-few-shot-reasoning-for-free91.0
bongard-openworld-few-shot-reasoning-for-free49.3
cognitive-paradigms-for-evaluating-vlms-on92.8
cognitive-paradigms-for-evaluating-vlms-on93.6
bongard-openworld-few-shot-reasoning-for-free63.3
bongard-openworld-few-shot-reasoning-for-free55.5
bongard-openworld-few-shot-reasoning-for-free64.0
bongard-openworld-few-shot-reasoning-for-free49.3
bongard-openworld-few-shot-reasoning-for-free63.8