VisualOverload Scene Image Understanding Dataset
Date
15 days ago
Size
601.3 MB
Publish URL
License
CC BY-SA 4.0
VisualOverload is a scene image understanding evaluation dataset that aims to examine the model's visual understanding and reasoning ability of details in complex scenes without relying on external knowledge.
This dataset contains 2,720 question-answer pairs, consisting of public-domain, high-resolution paintings that often feature multiple characters, actions, subplots, and complex backgrounds. The questions are manually designed to comprehensively test the model's scene understanding. This dataset is suitable for visual question answering research, detailed image understanding and reasoning, and evaluation of complex scenes with multiple characters and elements.

VisualOverload.torrent
Seeding 1Downloading 0Completed 1Total Downloads 11