VizWiz Visual Question Answering Dataset for the Blind
Date
3 years ago
Size
17.65 GB
Publish URL
Paper URL
License
CC BY 4.0

VizWiz-VQA (Visual Question Answering) is an image dataset for visual question answering for the blind. Blind users use the VizWiz software to take a photo and record a verbal question about the photo and 10 crowdsourced answers to the question. This dataset is used to solve the following two problems: one is to predict the answer to a visual question, and the other is to determine whether a visual question can be answered. This dataset aims to study more general algorithms to help blind people solve life obstacles.
The dataset includes (2020 latest version):
- 20,523 pairs of training images/questions
- 205,230 for training answers/answer confidence
- 4319 Verification images/questions
- 43,190 Verification answers/answer confidence
- 8,000 pairs of test images/questions
VisWiz.torrent
Seeding 1Downloading 0Completed 231Total Downloads 383
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.
AI Co-coding
Ready-to-use GPUs
Best Pricing
Hyper Newsletters
Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp