2 months ago

Scaling Agents via Continual Pre-training

Liangcai Su Zhen Zhang Guangyu Li Zhuo Chen Chenxi Wang Maojia Song Xinyu Wang Kuan Li Jialong Wu Xuanzhong Chen

Abstract

Large language models (LLMs) have evolved into agentic systems capable ofautonomous tool use and multi-step reasoning for complex problem-solving.However, post-training approaches building upon general-purpose foundationmodels consistently underperform in agentic tasks, particularly in open-sourceimplementations. We identify the root cause: the absence of robust agenticfoundation models forces models during post-training to simultaneously learndiverse agentic behaviors while aligning them to expert demonstrations, therebycreating fundamental optimization tensions. To this end, we are the first topropose incorporating Agentic Continual Pre-training (Agentic CPT) into thedeep research agents training pipeline to build powerful agentic foundationalmodels. Based on this approach, we develop a deep research agent model namedAgentFounder. We evaluate our AgentFounder-30B on 10 benchmarks and achievestate-of-the-art performance while retains strong tool-use ability, notably39.9% on BrowseComp-en, 43.3% on BrowseComp-zh, and 31.5% Pass@1 on HLE.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Scaling Agents via Continual Pre-training

Liangcai Su Zhen Zhang Guangyu Li Zhuo Chen Chenxi Wang Maojia Song Xinyu Wang Kuan Li Jialong Wu Xuanzhong Chen12 more

Abstract

Build AI with AI

Hyper Newsletters

Liangcai Su Zhen Zhang Guangyu Li Zhuo Chen Chenxi Wang Maojia Song Xinyu Wang Kuan Li Jialong Wu Xuanzhong Chen