Search for a command to run...
Insights from Benchmarking Frontier Language Models on Web App Code Generation