Multimodal Reasoning On Rebus
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Accuracy |
---|---|
rebus-a-robust-evaluation-benchmark-of-1 | 0.6 |
rebus-a-robust-evaluation-benchmark-of-1 | 0.9 |
rebus-a-robust-evaluation-benchmark-of-1 | 0.9 |
rebus-a-robust-evaluation-benchmark-of-1 | 1.8 |
rebus-a-robust-evaluation-benchmark-of-1 | 13.2 |
rebus-a-robust-evaluation-benchmark-of-1 | 1.5 |
rebus-a-robust-evaluation-benchmark-of-1 | 0.9 |
rebus-a-robust-evaluation-benchmark-of-1 | 24.0 |