HyperAIHyperAI

Command Palette

Search for a command to run...

PhysToolBench Physics Tool Task Dataset

PhysToolBench is a Visual-Language Question Answering (VQA) dataset released in 2025 by the Hong Kong University of Science and Technology (Guangzhou) in collaboration with the Hong Kong University of Science and Technology, Beijing University of Aeronautics and Astronautics, and other institutions. The related research paper is titled "...".PhysToolBench: Benchmarking Physical Tool Understanding for MLLMsThe study aims to evaluate the ability of multimodal large language models (MLLMs) to identify, understand, and create physical tools.

This dataset contains over 1,000 image-text pairs, covering various scenarios including daily life, industry, outdoor activities, and professional environments. It is divided into three levels of difficulty: easy, medium, and hard. The task structure is as follows:

  • Tool Creation
  • Tool Recognition
  • Tool Understanding
Dataset Example

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp