LSVTD Video Text Understanding Dataset
Date
3 years ago
Publish URL
License
其他
Categories

LSVTD stands for large-scale video text dataset, which contains 100 videos from 21 natural scenes. The dataset covers a wide range of 13 indoor (such as bookstores, shopping malls) and 9 outdoor scenes, and its diversity is more than 3 times that of the IC15 dataset.