HyperAI

Clotho Audio Subtitles Dataset

Date

3 years ago

Size

4.36 GB

Organization

Tampere University

Publish URL

zenodo.org

License

其他

特色图像

Clotho is an audio captioning dataset. The dataset focuses on the content of audio and the diversity of captions, and consists of 4,981 audio samples, each with 5 captions (24,905 captions in total), with a duration of 15 to 30 seconds and a caption length of 8 to 20 words.

Clotho.torrent
Seeding 1Downloading 1Completed 544Total Downloads 587
  • Clotho/
    • README.md
      1.03 KB
    • README.txt
      2.06 KB
      • data/
        • LICENSE
          3.88 KB
        • clotho_audio_development.7z
          3.2 GB
        • clotho_audio_evaluation.7z
          4.36 GB
        • clotho_captions_development.csv
          4.36 GB
        • clotho_captions_evaluation.csv
          4.36 GB
        • clotho_metadata_development.csv
          4.36 GB
        • clotho_metadata_evaluation.csv
          4.36 GB