HyperAI

20 Newsgroups Newsgroup Document Dataset

Date

2 years ago

Size

44.31 MB

Publish URL

qwone.com

License

非商业用途

20 Newsgroups is a dataset consisting of approximately 20,000 news documents and has become a popular dataset for experiments on text applications in machine learning.

The dataset is evenly distributed among 20 different newsgroups and is one of the international standard datasets used for text classification, text mining, and information retrieval research.

The 20 Newsgroups dataset was published by Ken Lang in the Proceedings of the 12th International Conference on Machine Learning in 1995. The related paper is Newsweeder: Learning to filter netnews.

20 Newsgroups.torrent
Seeding 2Downloading 0Completed 820Total Downloads 1,644
  • 20 Newsgroups/
    • README.md
      1.19 KB
    • README.txt
      2.38 KB
      • data/
        • 20news-18828.tar.gz
          13.99 MB
        • 20news-19997.tar.gz
          30.52 MB
        • 20news-bydate.tar.gz
          44.31 MB