Gutenberg Dataset E-book Dataset
Date
2 years ago
Publish URL
Categories
The Gutenberg dataset contains 3036 English books by 142 authors. It is a small part of the Project Gutenberg corpus and is mainly used for language modeling.
This dataset was released by Microft AI in April 2014. The main publisher was Matthew D. Scholefield. The related paper is "Complexity of Word Collocation Networks: A Preliminary Structural Analysis".