Date

a year ago

Size

3.06 GB

Paper URL

arxiv.org

License

Apache 2.0

Tags

Audio Classification

Text-to-Audio

NonverbalTTS is a non-verbal audio generation dataset released by VK Lab and Yandex in 2025. The related paper results are "NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech", which aims to promote expressive text-to-audio (TTS) research and support models to generate natural speech that contains emotions and non-verbal sounds. The dataset contains 17 hours of high-quality speech data from 2,296 participants (60% males, 40% females), covering 10 non-verbal speech types (breathing, laughing, sighing, sneezing, coughing, throat clearing, groaning, grunting, snoring, and inhaling) and 8 emotion categories (anger, disgust, fear, happiness, neutral, sadness, surprise, and other).

Dataset features:

Multi-source data: derived from VoxCeleb and Expresso corpora
Rich metadata: emotion tags, non-verbal speech annotations, speaker IDs, audio quality metrics
Sampling rate: 16kHz for audio from VoxCeleb, 48kHz for audio from Expresso

Citation

@inproceedings{borisov25_ssw, title = {{NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech}}, author = {Maksim Borisov and Egor Spirin and Daria Diatlova}, year = {2025}, booktitle = {{13th edition of the Speech Synthesis Workshop}}, pages = {104–109}, doi = {10.21437/SSW.2025-16}, }

NonverbalTTS.torrent

Seeding 1Downloading 0Completed 75Total Downloads 168

NonverbalTTS/
- README.md
  1.77 KB
- README.txt
  3.55 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

a year ago

Size

3.06 GB

Paper URL

arxiv.org

License

Apache 2.0

Dataset features:

Multi-source data: derived from VoxCeleb and Expresso corpora
Rich metadata: emotion tags, non-verbal speech annotations, speaker IDs, audio quality metrics
Sampling rate: 16kHz for audio from VoxCeleb, 48kHz for audio from Expresso

Citation

NonverbalTTS.torrent

Seeding 1Downloading 0Completed 75Total Downloads 168

NonverbalTTS/
- README.md
  1.77 KB
- README.txt
  3.55 KB

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

37 minutes ago

SMOL Multilingual Translation Parallel Dataset

a month ago

Emotion-probes Emotion Detection Dataset

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

NonverbalTTS non-verbal Audio Generation Dataset

Dataset features:

Citation

Build AI with AI

HyperAI Newsletters

Command Palette

NonverbalTTS non-verbal Audio Generation Dataset

Dataset features:

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SMOL Multilingual Translation Parallel Dataset

Emotion-probes Emotion Detection Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

NonverbalTTS non-verbal Audio Generation Dataset

Dataset features:

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SMOL Multilingual Translation Parallel Dataset

Emotion-probes Emotion Detection Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SMOL Multilingual Translation Parallel Dataset

Emotion-probes Emotion Detection Dataset

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SMOL Multilingual Translation Parallel Dataset

Emotion-probes Emotion Detection Dataset