Mathematical Reasoning On Frontiermath
Metrics
Accuracy
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Accuracy |
---|---|
Model 1 | 0.01 |
Model 2 | 0.01 |
Model 3 | 0.01 |
Model 4 | 0.01 |
Model 5 | 0.252 |
frontiermath-a-benchmark-for-evaluating | 0.02 |