Use this Dataset

Discuss on Discord

Date

3 years ago

Size

1.76 GB

Organization

Publish URL

ai.stanford.edu

Paper URL

License

Other

Tags

Image Understanding

Visual7W is a dataset for understanding image content. It performs visual question answering tasks by describing image regions in text and their associations. The dataset contains not only the image itself, but also questions and answers related to the content of the image region.

Visual7W is a subset of the Visual Genome dataset, containing 47,300 COCO dataset images, 327,929 question-answer pairs, 1,311,756 human-generated multiple-choice questions, and 561,459 object groundings covering 36,579 categories.

Visual7W's questions are mainly composed of What, Where, How, When, Who, Why, and Which. The questions are multiple-choice, and each question has four candidate answers.

Visual7W.torrent

Seeding 1Downloading 0Completed 584Total Downloads 738

Visual7W/
- README.md
  1.34 KB
- README.txt
  2.68 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

Use this Dataset

Discuss on Discord

Date

3 years ago

Size

1.76 GB

Organization

Publish URL

ai.stanford.edu

Paper URL

License

Other

Tags

Image Understanding

Visual7W is a dataset for understanding image content. It performs visual question answering tasks by describing image regions in text and their associations. The dataset contains not only the image itself, but also questions and answers related to the content of the image region.

Visual7W is a subset of the Visual Genome dataset, containing 47,300 COCO dataset images, 327,929 question-answer pairs, 1,311,756 human-generated multiple-choice questions, and 561,459 object groundings covering 36,579 categories.

Visual7W's questions are mainly composed of What, Where, How, When, Who, Why, and Which. The questions are multiple-choice, and each question has four candidate answers.

Visual7W.torrent

Seeding 1Downloading 0Completed 584Total Downloads 738

Visual7W/
- README.md
  1.34 KB
- README.txt
  2.68 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp