HyperAI超神经

Multimodal Reasoning On Rebus

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Accuracy
rebus-a-robust-evaluation-benchmark-of-10.6
rebus-a-robust-evaluation-benchmark-of-10.9
rebus-a-robust-evaluation-benchmark-of-10.9
rebus-a-robust-evaluation-benchmark-of-11.8
rebus-a-robust-evaluation-benchmark-of-113.2
rebus-a-robust-evaluation-benchmark-of-11.5
rebus-a-robust-evaluation-benchmark-of-10.9
rebus-a-robust-evaluation-benchmark-of-124.0