Automated Theorem Proving On Minif2F 1
평가 지표
Pass@64
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | Pass@64 | Paper Title | Repository |
---|---|---|---|
Evariste | 32.1 | HyperTree Proof Search for Neural Theorem Proving | - |
Evariste-1d | 33.6 | HyperTree Proof Search for Neural Theorem Proving | - |
GPT-f | 30.6 | HyperTree Proof Search for Neural Theorem Proving | - |
Evariste-7d | 42.5 | HyperTree Proof Search for Neural Theorem Proving | - |
0 of 4 row(s) selected.