Mathematical Reasoning On Pgps9K
评估指标
Completion accuracy
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Completion accuracy |
---|---|
a-multi-modal-neural-geometric-solver-with | 62.7 |
geoqa-a-geometric-question-answering | 34.1 |
inter-gps-interpretable-geometry-problem | 59.8 |
unigeo-unifying-geometry-logical-reasoning | 35.6 |
gaps-geometry-aware-problem-solver | 61.2 |
gold-geometry-problem-solver-with-natural | 65.8 |