Visual Genome Densely Annotated Dataset
Date
2 years ago
Size
15.31 GB
Publish URL
License
CC BY 4.0
Categories

The Visual Genome Dataset is a dataset that connects language and vision through crowdsourced dense image annotation, including Visual Question Answering data in a multiple-choice environment.
The dataset consists of 1.7 million QA pairs for 101,174 MSCOCO images, with an average of 17 questions per image.
Compared to the Visual Question Answering dataset, the Visual Genome dataset has a more balanced distribution of six types of questions: What, Where, When, Who, Why, and How. In addition, Visual Genome also displays 108,000 images densely annotated with targets, attributes, and relationships.
Visual_Genome_Dataset.torrent
Seeding 2Downloading 1Completed 591Total Downloads 865