CSS10 Speech Dataset
Date
3 years ago
Publish URL
Categories

CSS10 is a dataset of single-speaker speech in ten languages. The dataset contains short audio clips of LibriVox audiobooks and their calibration text. The researchers also trained two neural models for generating speech from text based on the speech dataset to verify the quality of the speech dataset. The dataset can be used for speech tasks in the future.