Automated Theorem Proving On Minif2F Valid
Metriken
Pass@64
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Vergleichstabelle
Modellname | Pass@64 |
---|---|
hypertree-proof-search-for-neural-theorem | 47.3 |
hypertree-proof-search-for-neural-theorem | 46.7 |
minif2f-a-cross-system-benchmark-for-formal | - |
minif2f-a-cross-system-benchmark-for-formal | - |
hypertree-proof-search-for-neural-theorem | 47.5 |
hypertree-proof-search-for-neural-theorem | 58.6 |
minif2f-a-cross-system-benchmark-for-formal | - |
draft-sketch-and-prove-guiding-formal-theorem | - |
lyra-orchestrating-dual-correction-in | - |
lego-prover-neural-theorem-proving-with | - |