1 Billion Word Language Model Benchmark R13 Output Benchmark Corpus
Date
6 years ago
Size
1.67 GB
Publish URL
Categories
1 Billion Word Language Model Benchmark R13 Output is a new benchmark corpus used to measure and count progress in language modeling. With nearly 1 billion words of training data, the benchmark can quickly evaluate new language modeling techniques and combine them with other new technologies.
This dataset was released by Cornell University in 2013, and the main publishers are Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn and Tony Robinson.
1-billion-word-language-modeling-benchmark-r13output.torrent
Seeding 4Downloading 0Completed 867Total Downloads 1,532