icwb2-data Chinese Word Segmentation Dataset
The icwb2-data dataset is jointly released by Peking University, City University of Hong Kong, CKIP in Taiwan, Academia Sinica and Microsoft Research China, and is used to train Chinese word segmentation models. AS and CityU are traditional Chinese datasets, and PK and MSR are simplified Chinese datasets.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.