HyperAIHyperAI

Command Palette

Search for a command to run...

VisDial Image Dialogue Dataset

Date

3 years ago

Size

1.86 GB

Organization

Publish URL

visualdialog.org

Paper URL

arxiv.org

License

CC BY 4.0

Featured Image

VisDial, the full name of Visual Dialog, is a dataset containing manual annotation problems based on images from the MS COCO dataset.

The dataset was developed by having two subjects chat about an image on Amazon Mechanical Turk. One of them acts as the questioner and the other acts as the answerer. The questioner can only see the text description of the image (i.e. the image caption from the MS COCO dataset), and the original image is not visible to the questioner. Their task is to ask questions around this image to "better imagine the scene". The answerer sees the image, the caption, and answers the questions asked by the questioner. The two of them can continue the conversation by asking and answering questions, up to 10 rounds.

VisDial v1.0 includes:

  • Training set: 1,23,287 images, 10 rounds of dialogue per image;
  • Validation set: 2,064 images, 10 rounds of dialogue per image;
  • Test set: 8,000 images, 1 turn of dialogue per image.
VisDial.torrent
Seeding 2Downloading 0Completed 582Total Downloads 696
  • VisDial/
    • README.md
      1.58 KB
    • README.txt
      3.15 KB
      • data/
        • VisualDialog_test2018.zip
          1.2 GB
        • VisualDialog_val2018.zip
          1.51 GB
        • visdial_1.0_test.zip
          1.51 GB
        • visdial_1.0_train.zip
          1.85 GB
        • visdial_1.0_val.zip
          1.86 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp