HyperAI

Mathematical Reasoning On Frontiermath

Metrics

Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAccuracy
Model 10.01
Model 20.01
Model 30.01
Model 40.01
Model 50.252
frontiermath-a-benchmark-for-evaluating0.02