HyperAI

Visual7W Visual Question Answering Dataset

Date

3 years ago

Size

1.76 GB

Organization

Stanford University

Publish URL

ai.stanford.edu

License

其他

特色图像

Visual7W is a dataset for understanding image content. It performs visual question answering tasks by describing image regions in text and their associations. The dataset contains not only the image itself, but also questions and answers related to the content of the image region.

Visual7W is a subset of the Visual Genome dataset, containing 47,300 COCO dataset images, 327,929 question-answer pairs, 1,311,756 human-generated multiple-choice questions, and 561,459 object groundings covering 36,579 categories.

Visual7W's questions are mainly composed of What, Where, How, When, Who, Why, and Which. The questions are multiple-choice, and each question has four candidate answers.

Visual7W.torrent
Seeding 1Downloading 1Completed 397Total Downloads 510
  • Visual7W/
    • README.md
      1.34 KB
    • README.txt
      2.68 KB
      • data/
        • dataset_v7w_grounding_annotations.zip
          7.07 MB
        • dataset_v7w_pointing.zip
          18.56 MB
        • dataset_v7w_telling.zip
          24.2 MB
        • visual7w-toolkit
          24.39 MB
        • visual7w_images.zip
          1.76 GB