HyperAI

TED-LIUM English Speech Recognition Training Corpus

TED-LIUM is a speech recognition training corpus from TED lectures, with transcribed, 16kHz audio clips, containing approximately 118 hours of lectures in total.

The dataset was created in 2012 by the University of Maine Laboratory for Computer Science (LIUM).

Main publishers: A. Rousseau, P. Deléglise, and Y. Estève

TED-LIUM.torrent
Seeding 2Downloading 0Completed 922Total Downloads 1,304
  • TED-LIUM/
    • README.md
      899 字节
    • README.txt
      1.76 KB
      • data/
        • TEDLIUM_release1.tar.gz
          2.83 GB
        • TEDLIUM_release1.tar.gz.1
          17.62 GB
        • TEDLIUM_release1.tar.gz.2
          37.45 GB
        • wget-log
          37.45 GB
        • wget-log.1
          37.48 GB