Automated Theorem Proving On Holist Benchmark
评估指标
Percentage correct
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Percentage correct |
---|---|
holist-an-environment-for-machine-learning-of | 32.65 |
learning-to-reason-in-large-theories-without | 36.55 |
holist-an-environment-for-machine-learning-of | 38.88 |
graph-representations-for-higher-order-logic | 49.95 |