THUCNews News Dataset
The THUCNews dataset is generated by filtering the historical data of Sina News from 2005 to 2011, and contains 740,000 news documents, all in UTF-8 plain text format. Based on the original Sina News classification system, this dataset is re-integrated into 14 candidate classification categories: finance, lottery, real estate, stocks, home, education, technology, society, fashion, current affairs, sports, constellations, games, and entertainment.
THUCNews.torrent
Seeding 3Downloading 1Completed 1,097Total Downloads 3,025