Search for a command to run...
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI