HyperAI

TextOCR Text Recognition Dataset

Date

3 years ago

Organization

Publish URL

textvqa.org

License

CC BY 4.0

Download Help
特色图像

OCR stands for optical character recognition. TextOCR is a dataset for detecting and recognizing text in arbitrary scenes. TextOCR provides about 1 million high-quality word annotations for images in TextVQA, and can perform end-to-end reasoning on downstream tasks such as visual question answering or image description.

The dataset includes:

  • 28,134 images from the TextVQA dataset
  • 903,096 annotated scene text words
  • On average, each image has 32 related words.