HyperAIHyperAI

Command Palette

Search for a command to run...

COREVQA Visual Question Answering Benchmark Dataset

Date

3 months ago

Size

5.63 GB

Publish URL

www.kaggle.com

Paper URL

2507.13405

License

Apache 2.0

COREVQA is a visual question answering benchmark dataset released by the Algoverse Artificial Intelligence Research Center in 2025. The related paper results are "COREVQA: A Crowd Observation and Reasoning Entailment Visual Question Answering Benchmark", which aims to evaluate the reasoning entailment ability of visual language models (VLMs) in crowd scenes.

This dataset contains 5,608 pairs of images and true/false sentences. The images are derived from the CrowdHuman dataset. The data primarily depicts real-world crowded scenes, emphasizing challenges such as occlusion, perspective changes, and background interference. It aims to advance the fine-grained perception and reasoning capabilities of VLMs in complex social scenarios.

The data includes:

  • Scene image (image_id)
  • Natural language statement (question)
  • Binary label (answer:TRUE / FALSE)

COREVQA.torrent
Seeding 1Downloading 0Completed 14Total Downloads 67
  • COREVQA/
    • README.md
      1.42 KB
    • README.txt
      2.85 KB
      • data/
        • COREVQA.zip
          5.63 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp