Search for a command to run...
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning