HyperAI

Automated Theorem Proving On Minif2F Valid

Metriken

Pass@64

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnamePass@64
hypertree-proof-search-for-neural-theorem47.3
hypertree-proof-search-for-neural-theorem46.7
minif2f-a-cross-system-benchmark-for-formal-
minif2f-a-cross-system-benchmark-for-formal-
hypertree-proof-search-for-neural-theorem47.5
hypertree-proof-search-for-neural-theorem58.6
minif2f-a-cross-system-benchmark-for-formal-
draft-sketch-and-prove-guiding-formal-theorem-
lyra-orchestrating-dual-correction-in-
lego-prover-neural-theorem-proving-with-