HyperAI

LSVTD Video Text Understanding Dataset

Date

3 years ago

Organization

Zhejiang University

License

其他

Download Help
特色图像

LSVTD stands for large-scale video text dataset, which contains 100 videos from 21 natural scenes. The dataset covers a wide range of 13 indoor (such as bookstores, shopping malls) and 9 outdoor scenes, and its diversity is more than 3 times that of the IC15 dataset.