3 months ago

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu

Abstract

Language agents have demonstrated remarkable potential in web search and information retrieval. However, these search agents assume user queries are complete and unambiguous, an assumption that diverges from reality where users begin with incomplete queries requiring clarification through interaction. Yet most agents lack interactive mechanisms during the search process, and existing benchmarks cannot assess this capability. To address this gap, we introduce InteractComp, a benchmark designed to evaluate whether search agents can recognize query ambiguity and actively interact to resolve it during search. Following the principle of easy to verify, interact to disambiguate, we construct 210 expert-curated questions across 9 domains through a target-distractor methodology that creates genuine ambiguity resolvable only through interaction. Evaluation of 17 models reveals striking failure: the best model achieves only 13.73% accuracy despite 71.50% with complete context, exposing systematic overconfidence rather than reasoning deficits. Forced interaction produces dramatic gains, demonstrating latent capability current strategies fail to engage. Longitudinal analysis shows interaction capabilities stagnated over 15 months while search performance improved seven-fold, revealing a critical blind spot. This stagnation, coupled with the immediate feedback inherent to search tasks, makes InteractComp a valuable resource for both evaluating and training interaction capabilities in search agents. The code is available at https://github.com/FoundationAgents/InteractComp.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

3 months ago

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

3 months ago

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

InteractComp: Evaluating Search Agents With Ambiguous Queries

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu15 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

InteractComp: Evaluating Search Agents With Ambiguous Queries

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu15 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

InteractComp: Evaluating Search Agents With Ambiguous Queries

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu15 more

Abstract

Build AI with AI

HyperAI Newsletters

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu

Mingyi Deng Lijun Huang Yani Fan Jiayi Zhang Fashen Ren Jinyi Bai Fuzhen Yang Dayi Miao Zhaoyang Yu Yifan Wu