Visual Reasoning On Bongard Openworld
評価指標
2-Class Accuracy
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
比較表
モデル名 | 2-Class Accuracy |
---|---|
bongard-openworld-few-shot-reasoning-for-free | 91.0 |
bongard-openworld-few-shot-reasoning-for-free | 49.3 |
cognitive-paradigms-for-evaluating-vlms-on | 92.8 |
cognitive-paradigms-for-evaluating-vlms-on | 93.6 |
bongard-openworld-few-shot-reasoning-for-free | 63.3 |
bongard-openworld-few-shot-reasoning-for-free | 55.5 |
bongard-openworld-few-shot-reasoning-for-free | 64.0 |
bongard-openworld-few-shot-reasoning-for-free | 49.3 |
bongard-openworld-few-shot-reasoning-for-free | 63.8 |