HyperAI

LJ Speech Dataset

This is a public domain speech dataset containing 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. Transcriptions are provided for each clip. The clips range in length from 1 to 10 seconds, with a total length of approximately 24 hours.

The texts were published between 1884 and 1964 and are in the public domain. Audio courtesy of LibriVox The project was recorded in 2016-17 and is also in the public domain.

LJ-Speech-Dataset.torrent
Seeding 1Downloading 1Completed 61Total Downloads 120
  • LJ-Speech-Dataset/
    • README.md
      1.12 KB
    • README.txt
      2.23 KB
      • data/
        • LJSpeech-1.1.tar.bz2
          2.56 GB