HyperAI

CORD Information Extraction Dataset

Date

2 years ago

Size

1.91 GB

Organization

Publish URL

github.com

License

CC BY 4.0

特色图像

CORD stands for Consolidated Receipt Dataset for Post-OCR Parsing, which is a receipt dataset for Post-OCR parsing. The dataset contains thousands of Indonesian receipts (including images and box/text annotations for OCR, and multi-level semantic labels for parsing).

CORD.torrent
Seeding 1Downloading 1Completed 365Total Downloads 551
  • CORD/
    • .gitattributes
      1.7 KB
    • README.md
      2.64 KB
    • README.txt
      3.59 KB
      • data/
        • CORD.zip
          1.91 GB
    • dataset_infos.json
      1.91 GB