HyperAI

HowToVQA69M Video Question Answering Dataset

Date

3 years ago

Size

7.88 GB

Organization

License

其他

特色图像

VQA stands for Visual question answering. HowToVQA69M is a video question answering dataset containing 69,270,581 questions and answers. Its scale is twice as large as the existing video question answering dataset VideoQA.

On average, each raw video generates 43 video clips, each 12.1 seconds long and associated with 1.2 questions and answers, with questions containing 8.7 words and answers containing 2.4 words. The HowToVQA69M dataset is highly diverse, containing more than 16 million unique answers, of which more than 2 million unique answers appear more than once and more than 300,000 unique answers appear more than 10 times.

HowToVQA69M.torrent
Seeding 1Downloading 1Completed 476Total Downloads 407
  • HowToVQA69M/
    • README.md
      1.23 KB
    • README.txt
      2.47 KB
      • data/
        • ReadMe.txt
          3.38 KB
        • howtovqa.pkl
          5.98 GB
        • train_howtovqa.csv
          6.02 GB
        • val_howtovqa.csv
          6.02 GB
          • vedio/
            • HowTo100M.zip
              7.88 GB