HyperAI超神経

Automated Theorem Proving On Minif2F Valid

評価指標

Pass@64

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Pass@64
hypertree-proof-search-for-neural-theorem47.3
hypertree-proof-search-for-neural-theorem46.7
minif2f-a-cross-system-benchmark-for-formal-
minif2f-a-cross-system-benchmark-for-formal-
hypertree-proof-search-for-neural-theorem47.5
hypertree-proof-search-for-neural-theorem58.6
minif2f-a-cross-system-benchmark-for-formal-
draft-sketch-and-prove-guiding-formal-theorem-
lyra-orchestrating-dual-correction-in-
lego-prover-neural-theorem-proving-with-