HyperAI

Visual Reasoning On Bongard Openworld

Metrics

2-Class Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model Name2-Class Accuracy
bongard-openworld-few-shot-reasoning-for-free91.0
bongard-openworld-few-shot-reasoning-for-free49.3
cognitive-paradigms-for-evaluating-vlms-on92.8
cognitive-paradigms-for-evaluating-vlms-on93.6
bongard-openworld-few-shot-reasoning-for-free63.3
bongard-openworld-few-shot-reasoning-for-free55.5
bongard-openworld-few-shot-reasoning-for-free64.0
bongard-openworld-few-shot-reasoning-for-free49.3
bongard-openworld-few-shot-reasoning-for-free63.8