HyperAIHyperAI

Command Palette

Search for a command to run...

HowToVQA69M Video Question Answering Dataset

Date

3 years ago

Size

7.88 GB

Organization

Paper URL

arxiv.org

License

Other

Featured Image

VQA stands for Visual question answering. HowToVQA69M is a video question answering dataset containing 69,270,581 questions and answers. Its scale is twice as large as the existing video question answering dataset VideoQA.

On average, each raw video generates 43 video clips, each 12.1 seconds long and associated with 1.2 questions and answers, with questions containing 8.7 words and answers containing 2.4 words. The HowToVQA69M dataset is highly diverse, containing more than 16 million unique answers, of which more than 2 million unique answers appear more than once and more than 300,000 unique answers appear more than 10 times.

HowToVQA69M.torrent
Seeding 2Downloading 0Completed 652Total Downloads 554
  • HowToVQA69M/
    • README.md
      1.23 KB
    • README.txt
      2.47 KB
      • data/
        • ReadMe.txt
          3.38 KB
        • howtovqa.pkl
          5.98 GB
        • train_howtovqa.csv
          6.02 GB
        • val_howtovqa.csv
          6.02 GB
          • vedio/
            • HowTo100M.zip
              7.88 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp