Automated Theorem Proving On Metamath Setmm
评估指标
Percentage correct
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Percentage correct |
---|---|
generative-language-modeling-for-automated | 56.2 |
hypertree-proof-search-for-neural-theorem | - |
learning-to-prove-theorems-by-learning-to-1 | 22.1 |
holophrasm-a-neural-automated-theorem-prover | 14.3 |