2 months ago

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang

Abstract

The scarcity of large-scale, open-source data for dialects severely hinders progress in speech technology, a challenge particularly acute for the widely spoken Sichuanese dialects of Chinese. To address this critical gap, we introduce WenetSpeech-Chuan, a 10,000-hour, richly annotated corpus constructed using our novel Chuan-Pipeline, a complete data processing framework for dialectal speech. To facilitate rigorous evaluation and demonstrate the corpus's effectiveness, we also release high-quality ASR and TTS benchmarks, WenetSpeech-Chuan-Eval, with manually verified transcriptions. Experiments show that models trained on WenetSpeech-Chuan achieve state-of-the-art performance among open-source systems and demonstrate results comparable to commercial services. As the largest open-source corpus for Sichuanese dialects, WenetSpeech-Chuan not only lowers the barrier to research in dialectal speech processing but also plays a crucial role in promoting AI equity and mitigating bias in speech technologies. The corpus, benchmarks, models, and receipts are publicly available on our project page.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

2 months ago

Audio and Speech Processing

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

2 months ago

Audio and Speech Processing

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang6 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang6 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

WenetSpeech-Chuan: A Large-Scale Sichuanese Corpus with Rich Annotation for Dialectal Speech Processing

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang6 more

Abstract

Build AI with AI

HyperAI Newsletters

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang

Yuhang Dai Ziyu Zhang Shuai Wang Longhao Li Zhao Guo Tianlun Zuo Shuiyuan Wang Hongfei Xue Chengyou Wang Qing Wang