Date

8 months ago

Size

55.66 MB

Organization

Paper URL

2509.22727

License

CC BY 4.0

Tags

Multimodal

Audio and Speech Processing

Synthesis

Audio Recognition

Text-to-Speech

DiaMoE-TTS is a speech dataset for multi-dialect text-to-speech (TTS) tasks, released in 2025 by Tsinghua University in collaboration with Giant Interactive. The related research paper is titled "...".DiaMoE-TTS: A Unified IPA-Based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot AdaptationThe goal is to build a unified dialect phonetic representation system to support transferable speech modeling and zero-shot dialect synthesis research across multiple dialects. This dataset is built upon multiple open-source dialect speech resources and employs IPA (International Phonetic Alphabet) as a unified phonetic representation system for consistent phonological annotation across different dialect corpora. The speech sources include the Common Voice Cantonese dataset, the Emilia Mandarin corpus, dialect speech from the KeSpeech corpus, and the open-source Minnan (Hokkien) speech dataset. During data processing, all speech samples underwent a unified phoneme-level phonetic conversion, constructing an IPA front-end annotation sequence that can be aligned across dialects.

DiaMoE-TTS.torrent

Seeding 1Downloading 0Completed 4Total Downloads 99

DiaMoE-TTS/
- README.md
  1.74 KB
- README.txt
  3.47 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

8 months ago

Size

55.66 MB

Organization

Paper URL

2509.22727

License

CC BY 4.0

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

a day ago

GroundingME Complex Scene Understanding Evaluation Dataset

a day ago

MCIF Multimodal Cross-Language Instruction Following Dataset

6 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

DiaMoE-TTS Multi-Dialect Speech Phonetic Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

DiaMoE-TTS Multi-Dialect Speech Phonetic Dataset

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

DiaMoE-TTS Multi-Dialect Speech Phonetic Dataset

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

Related Datasets

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset