Search for a command to run...
AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery