HyperAI

How 2R Video Retrieval Dataset

Date

3 years ago

Organization

Microsoft Dynamics 365 AI Research

Publish URL

github.com

License

其他

Categories

Download Help
特色图像

How 2R is a dataset for text-based video retrieval. The dataset contains 24,328 60-second clips and 51,390 related query terms collected from 9,371 videos in the HowTo 100M dataset, with an average of 2-3 related query terms per clip. 80% of the data is used for training, 10% of the data is used for verification, and 10% of the data is used for testing.

How 2R and How 2QA are new challenging benchmarks that can be used to study the fields of video retrieval and video question answering.