Search for a command to run...
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs