HyperAI

1 Billion Word Language Model Benchmark R13 Output Benchmark Corpus

Date

6 years ago

Size

1.67 GB

Organization

Cornell University

Publish URL

www.statmt.org

1 Billion Word Language Model Benchmark R13 Output is a new benchmark corpus used to measure and count progress in language modeling. With nearly 1 billion words of training data, the benchmark can quickly evaluate new language modeling techniques and combine them with other new technologies.

This dataset was released by Cornell University in 2013, and the main publishers are Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn and Tony Robinson.

1-billion-word-language-modeling-benchmark-r13output.torrent
Seeding 4Downloading 0Completed 867Total Downloads 1,532
  • 1-billion-word-language-modeling-benchmark-r13output/
    • README.md
      1.18 KB
    • README.txt
      2.36 KB
      • data/
        • 1-billion-word-language-modeling-benchmark-r13output.tar.gz
          1.67 GB