HyperAI超神経

Visual Reasoning On Bongard Openworld

評価指標

2-Class Accuracy

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名2-Class Accuracy
bongard-openworld-few-shot-reasoning-for-free91.0
bongard-openworld-few-shot-reasoning-for-free49.3
cognitive-paradigms-for-evaluating-vlms-on92.8
cognitive-paradigms-for-evaluating-vlms-on93.6
bongard-openworld-few-shot-reasoning-for-free63.3
bongard-openworld-few-shot-reasoning-for-free55.5
bongard-openworld-few-shot-reasoning-for-free64.0
bongard-openworld-few-shot-reasoning-for-free49.3
bongard-openworld-few-shot-reasoning-for-free63.8