Command Palette
Search for a command to run...
Automated Theorem Proving On Minif2F 1
평가 지표
Pass@64
평가 결과
이 벤치마크에서 각 모델의 성능 결과
| Paper Title | ||
|---|---|---|
| Evariste-7d | 42.5 | HyperTree Proof Search for Neural Theorem Proving |
| Evariste-1d | 33.6 | HyperTree Proof Search for Neural Theorem Proving |
| Evariste | 32.1 | HyperTree Proof Search for Neural Theorem Proving |
| GPT-f | 30.6 | HyperTree Proof Search for Neural Theorem Proving |
0 of 4 row(s) selected.