Date

4 years ago

Size

1.86 GB

Organization

Publish URL

visualdialog.org

Paper URL

arxiv.org

License

CC BY 4.0

Tags

Multimodal

Deep Learning

Visual Question Answering

Image Understanding

VisDial, the full name of Visual Dialog, is a dataset containing manual annotation problems based on images from the MS COCO dataset. The dataset was developed by having two subjects chat about an image on Amazon Mechanical Turk. One of them acts as the questioner and the other acts as the answerer. The questioner can only see the text description of the image (i.e. the image caption from the MS COCO dataset), and the original image is not visible to the questioner. Their task is to ask questions around this image to "better imagine the scene". The answerer sees the image, the caption, and answers the questions asked by the questioner. The two of them can continue the conversation by asking and answering questions, up to 10 rounds. VisDial v1.0 includes:

Training set: 1,23,287 images, 10 rounds of dialogue per image;
Validation set: 2,064 images, 10 rounds of dialogue per image;
Test set: 8,000 images, 1 turn of dialogue per image.

VisDial.torrent

Seeding 2Downloading 0Completed 620Total Downloads 794

VisDial/
- README.md
  1.58 KB
- README.txt
  3.15 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

4 years ago

Size

1.86 GB

Organization

Publish URL

visualdialog.org

Paper URL

arxiv.org

License

CC BY 4.0

Related Datasets

COCO-2017-Vietnamese Vietnamese Image Detection Dataset

2 months ago

Vehicles OpenImages Vehicle Image Dataset

5 months ago

CCTV Incident Fall Detection Dataset

5 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

8 days ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

VisDial Image Dialogue Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

VisDial Image Dialogue Dataset

Related Datasets

COCO-2017-Vietnamese Vietnamese Image Detection Dataset

Vehicles OpenImages Vehicle Image Dataset

CCTV Incident Fall Detection Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

VisDial Image Dialogue Dataset

Related Datasets

COCO-2017-Vietnamese Vietnamese Image Detection Dataset

Vehicles OpenImages Vehicle Image Dataset

CCTV Incident Fall Detection Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

COCO-2017-Vietnamese Vietnamese Image Detection Dataset

Vehicles OpenImages Vehicle Image Dataset

CCTV Incident Fall Detection Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

Related Datasets

COCO-2017-Vietnamese Vietnamese Image Detection Dataset

Vehicles OpenImages Vehicle Image Dataset

CCTV Incident Fall Detection Dataset

GroundingME Complex Scene Understanding Evaluation Dataset