Date

2 years ago

1. Functional Description

Note: The one-click training I made currently only supports Chinese. If you want to train Japanese or English, you need to enable webui.

The method is to change the python run_all.py in the run.ipynb running code to python webui.py

2. Video Tutorial

https://www.bilibili.com/video/BV1WC411W79t

3. Operation method

1. Open run.ipynb

Click Run -> Run All Cells to start the program, automatically configure the environment, and start the service.

2. Open the output public URL

3. Choose the data type according to your audio

4. Click to start training

Click to see which step the process has reached in the foreground, and you can also see the log output in the background.

5. Open the API address

When the front end shows that prediction is being turned on

Open API address:

6. Voice cloning

Select the trained model, enter your text, and have fun.

4. Custom audio

1. Find data sets and create new data sets

2. Upload audio data

3. Modify the configuration and start

4. Bind a new input address

5. Open the workspace

In this way, you can see the newly bound data set in the sidebar on the right.

6. Training to fill in the newly bound address

This notebook is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Notebooks

MarkItDown, Microsoft's open-source Document Conversion Tool

3 months ago

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

3 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Run this Notebook

Date

2 years ago

1. Functional Description

Note: The one-click training I made currently only supports Chinese. If you want to train Japanese or English, you need to enable webui.

The method is to change the python run_all.py in the run.ipynb running code to python webui.py

2. Video Tutorial

https://www.bilibili.com/video/BV1WC411W79t

3. Operation method

1. Open run.ipynb

Click Run -> Run All Cells to start the program, automatically configure the environment, and start the service.

2. Open the output public URL

3. Choose the data type according to your audio

4. Click to start training

Click to see which step the process has reached in the foreground, and you can also see the log output in the background.

5. Open the API address

When the front end shows that prediction is being turned on

Open API address:

6. Voice cloning

Select the trained model, enter your text, and have fun.

4. Custom audio

1. Find data sets and create new data sets

2. Upload audio data

3. Modify the configuration and start

4. Bind a new input address

5. Open the workspace

In this way, you can see the newly bound data set in the sidebar on the right.

6. Training to fill in the newly bound address

Related Notebooks

MarkItDown, Microsoft's open-source Document Conversion Tool

3 months ago

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

3 months ago

Stable Diffusion Online Tutorial - RTX5090

3 months ago

Long-VITA: A Multimodal Understanding Demo With Millions of Tokens

3 months ago

Nemotron-Speech-Streaming-ASR: Automatic Speech Recognition Demo

2 months ago

TRELLIS.2 3D Generation Demo

2 months ago

TVM Tutorial 0.22.0

2 months ago

Triton Compiler Tutorial

2 months ago

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo

25 days ago

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

21 days ago

ACE-Step 1.5: Music Generation Demo

21 days ago

CPU Deployment of gpt-oss-20b-GGUF

19 days ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

GPT-SoVITS Audio Synthesis Online Demo

1. Functional Description

2. Video Tutorial

3. Operation method

4. Custom audio

Build AI with AI

HyperAI Newsletters

Command Palette

GPT-SoVITS Audio Synthesis Online Demo

1. Functional Description

2. Video Tutorial

3. Operation method

4. Custom audio

Related Notebooks

MarkItDown, Microsoft's open-source Document Conversion Tool

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

Stable Diffusion Online Tutorial - RTX5090

Long-VITA: A Multimodal Understanding Demo With Millions of Tokens

Nemotron-Speech-Streaming-ASR: Automatic Speech Recognition Demo

TRELLIS.2 3D Generation Demo

TVM Tutorial 0.22.0

Triton Compiler Tutorial

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

ACE-Step 1.5: Music Generation Demo

CPU Deployment of gpt-oss-20b-GGUF

Build AI with AI

HyperAI Newsletters

Command Palette

GPT-SoVITS Audio Synthesis Online Demo

1. Functional Description

2. Video Tutorial

3. Operation method

4. Custom audio

Related Notebooks

MarkItDown, Microsoft's open-source Document Conversion Tool

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

Stable Diffusion Online Tutorial - RTX5090

Long-VITA: A Multimodal Understanding Demo With Millions of Tokens

Nemotron-Speech-Streaming-ASR: Automatic Speech Recognition Demo

TRELLIS.2 3D Generation Demo

TVM Tutorial 0.22.0

Triton Compiler Tutorial

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

ACE-Step 1.5: Music Generation Demo

CPU Deployment of gpt-oss-20b-GGUF

Build AI with AI

HyperAI Newsletters

Related Notebooks

MarkItDown, Microsoft's open-source Document Conversion Tool

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

Stable Diffusion Online Tutorial - RTX5090

Long-VITA: A Multimodal Understanding Demo With Millions of Tokens

Nemotron-Speech-Streaming-ASR: Automatic Speech Recognition Demo

TRELLIS.2 3D Generation Demo

TVM Tutorial 0.22.0

Triton Compiler Tutorial

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

ACE-Step 1.5: Music Generation Demo

CPU Deployment of gpt-oss-20b-GGUF

Related Notebooks

MarkItDown, Microsoft's open-source Document Conversion Tool

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

Stable Diffusion Online Tutorial - RTX5090

Long-VITA: A Multimodal Understanding Demo With Millions of Tokens

Nemotron-Speech-Streaming-ASR: Automatic Speech Recognition Demo

TRELLIS.2 3D Generation Demo

TVM Tutorial 0.22.0

Triton Compiler Tutorial

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

ACE-Step 1.5: Music Generation Demo

CPU Deployment of gpt-oss-20b-GGUF