HyperAI초신경

Automated Theorem Proving On Holist Benchmark

평가 지표

Percentage correct

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Percentage correct
holist-an-environment-for-machine-learning-of32.65
learning-to-reason-in-large-theories-without36.55
holist-an-environment-for-machine-learning-of38.88
graph-representations-for-higher-order-logic49.95