Icwb2-data Chinese Word Segmentation Dataset
Date
2 years ago
Size
50.2 MB
Publish URL
The icwb2-data dataset is jointly released by Peking University, City University of Hong Kong, CKIP in Taiwan, Academia Sinica and Microsoft Research China, and is used to train Chinese word segmentation models. AS and CityU are traditional Chinese datasets, and PK and MSR are simplified Chinese datasets.
icwb2-data.torrent
Seeding 1Downloading 0Completed 1,084Total Downloads 2,301