HyperAI

SAOKE Manually Annotated Dataset

Date

3 years ago

Size

30.55 MB

Organization

Baidu

Publish URL

ai.baidu.com

License

其他

Categories

特色图像

SAOKE stands for Symbol Aided Open Knowledge Expression. It is a manually annotated dataset containing more than 40,000 Chinese sentences and corresponding facts in SAOKE form. It is the largest publicly available manually annotated dataset for open domain information extraction tasks.

This dataset has the following advantages:

  • The data is authentic and open to use: following the OIE system concept, using original sentences to express knowledge
  • Compatible with all types of knowledge: Provides a unified view of four types of knowledge (relationships, attributes, descriptions, and concepts)
  • Accurate expression: Ability to accurately express facts using discrete relational phrases, missing information, hidden information, etc.
SAOKE.torrent
Seeding 1Downloading 1Completed 354Total Downloads 454
  • SAOKE/
    • README.md
      1.26 KB
    • README.txt
      2.52 KB
      • data/
        • SAOKE_DATA.json
          30.55 MB