HyperAIHyperAI

Command Palette

Search for a command to run...

UserBench Benchmark

Date

10 days ago

Organization

Paper URL

2507.22034

UserBench was jointly proposed in July 2025 by the Salesforce AI Research team and a research team from the University of Illinois at Urbana-Champaign. The related research results were published in the paper "...".UserBench: An Interactive Gym Environment for User-Centric Agents".

UserBench is a user-centric benchmark designed to evaluate the performance of agents in multi-turn, preference-driven interactions. In UserBench, simulated users provide initial, vague task instructions, gradually revealing preferences over time, often implicitly. Agents must proactively clarify their goals, interpret subtle cues, and succeed through adaptive reasoning tools. Built on the standard Gymnasium framework, UserBench offers a modular, scalable setup with standardized interaction interfaces and a stable backend for tool usage, enabling rigorous and repeatable evaluation.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
UserBench Benchmark | Wiki | HyperAI