TaskCraft: Automated Generation of Agentic Tasks

Agentic tasks, which require multi-step problem solving with autonomy, tooluse, and adaptive reasoning, are becoming increasingly central to theadvancement of NLP and AI. However, existing instruction data lacks toolinteraction, and current agentic benchmarks rely on costly human annotation,limiting their scalability. We introduce \textsc{TaskCraft}, an automatedworkflow for generating difficulty-scalable, multi-tool, and verifiable agentictasks with execution trajectories. TaskCraft expands atomic tasks usingdepth-based and width-based extensions to create structurally andhierarchically complex challenges. Empirical results show that these tasksimprove prompt optimization in the generation workflow and enhance supervisedfine-tuning of agentic foundation models. We present a large-scale syntheticdataset of approximately 36,000 tasks with varying difficulty to support futureresearch on agent tuning and evaluation.