HyperAI超神経

Visual Question Answering On Iconqa

評価指標

Reasoning (Alg.)
Reasoning (Com.)
Reasoning (Cou.)
Reasoning (Est.)
Reasoning (Fra.)
Reasoning (Geo.)
Reasoning (Mea.)
Reasoning (Pat.)
Reasoning (Pro.)
Reasoning (Sce.)
Reasoning (Sen.)
Reasoning (Spa.)
Reasoning (Tim.)
Sub-tasks (Blank)
Sub-tasks (Img.)
Sub-tasks (Txt.)

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Reasoning (Alg.)Reasoning (Com.)Reasoning (Cou.)Reasoning (Est.)Reasoning (Fra.)Reasoning (Geo.)Reasoning (Mea.)Reasoning (Pat.)Reasoning (Pro.)Reasoning (Sce.)Reasoning (Sen.)Reasoning (Spa.)Reasoning (Tim.)Sub-tasks (Blank)Sub-tasks (Img.)Sub-tasks (Txt.)
iconqa-a-new-benchmark-for-abstract-diagram50.2781.6970.6899.0277.6081.8098.8356.6085.7067.0184.1151.4267.7278.2877.7272.17
iconqa-a-new-benchmark-for-abstract-diagram28.0248.1933.6340.4633.0638.0338.0733.6640.7635.3745.2537.1448.0928.4541.6436.86
iconqa-a-new-benchmark-for-abstract-diagram50.6275.6071.0599.2274.0980.0599.0762.7870.9458.5281.7849.4666.7277.0876.6670.47
iconqa-a-new-benchmark-for-abstract-diagram31.7345.2637.6462.2932.4838.7164.0236.2937.5135.4745.2537.5247.3746.6541.5636.02
iconqa-a-new-benchmark-for-abstract-diagram11.1241.2018.383.6234.8430.300.3634.8138.8134.2545.1636.4935.820.2941.7036.87
iconqa-a-new-benchmark-for-abstract-diagram50.5584.9571.1399.0275.8182.6198.9159.2287.6566.7286.1053.3869.9979.2779.6772.69
iconqa-a-new-benchmark-for-abstract-diagram49.1883.6771.0199.4178.3781.3199.3860.8187.8461.2586.1048.3469.7778.5378.7172.39
iconqa-a-new-benchmark-for-abstract-diagram47.3282.7368.9499.0876.2079.8698.9954.7984.8762.4983.2549.7068.0074.5277.3671.25
iconqa-a-new-benchmark-for-abstract-diagram47.4682.1267.5697.0673.7779.9996.5055.6782.4566.9282.1253.2066.5075.5476.3370.82
iconqa-a-new-benchmark-for-abstract-diagram56.7387.0077.8198.2482.1381.8797.9868.7595.7362.3992.4955.6277.9883.6282.6675.19
iconqa-a-new-benchmark-for-abstract-diagram51.1082.1270.8498.9577.4182.6098.7658.4686.0768.8084.7254.6468.6678.9279.1572.34
iconqa-a-new-benchmark-for-abstract-diagram50.0080.6565.0199.5472.4380.0799.4655.0183.7558.2284.5445.7868.2873.0375.9268.51