Visual Reasoning On Bongard Openworld
Metriken
2-Class Accuracy
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Vergleichstabelle
Modellname | 2-Class Accuracy |
---|---|
bongard-openworld-few-shot-reasoning-for-free | 91.0 |
bongard-openworld-few-shot-reasoning-for-free | 49.3 |
cognitive-paradigms-for-evaluating-vlms-on | 92.8 |
cognitive-paradigms-for-evaluating-vlms-on | 93.6 |
bongard-openworld-few-shot-reasoning-for-free | 63.3 |
bongard-openworld-few-shot-reasoning-for-free | 55.5 |
bongard-openworld-few-shot-reasoning-for-free | 64.0 |
bongard-openworld-few-shot-reasoning-for-free | 49.3 |
bongard-openworld-few-shot-reasoning-for-free | 63.8 |