HyperAI超神经

Multimodal Reasoning On Math V

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Accuracy
measuring-multimodal-mathematical-reasoning14.54
measuring-multimodal-mathematical-reasoning17.66
measuring-multimodal-mathematical-reasoning15.59
measuring-multimodal-mathematical-reasoning22.76