Date

2 years ago

1. Functional Description

Note: The one-click training I made currently only supports Chinese. If you want to train Japanese or English, you need to enable webui.

The method is to change the python run_all.py in the run.ipynb running code to python webui.py

2. Video Tutorial

https://www.bilibili.com/video/BV1WC411W79t

3. Operation method

1. Open run.ipynb

Click Run -> Run All Cells to start the program, automatically configure the environment, and start the service.

2. Open the output public URL

3. Choose the data type according to your audio

4. Click to start training

Click to see which step the process has reached in the foreground, and you can also see the log output in the background.

5. Open the API address

When the front end shows that prediction is being turned on

Open API address:

6. Voice cloning

Select the trained model, enter your text, and have fun.

4. Custom audio

1. Find data sets and create new data sets

2. Upload audio data

3. Modify the configuration and start

4. Bind a new input address

5. Open the workspace

In this way, you can see the newly bound data set in the sidebar on the right.

6. Training to fill in the newly bound address

This notebook is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Notebook Overview

Level

Beginner

Topic

Audio Generative AI

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Run this Notebook Discuss on Discord

Date

2 years ago

1. Functional Description

Note: The one-click training I made currently only supports Chinese. If you want to train Japanese or English, you need to enable webui.

The method is to change the python run_all.py in the run.ipynb running code to python webui.py

2. Video Tutorial

https://www.bilibili.com/video/BV1WC411W79t

3. Operation method

1. Open run.ipynb

Click Run -> Run All Cells to start the program, automatically configure the environment, and start the service.

2. Open the output public URL

3. Choose the data type according to your audio

4. Click to start training

Click to see which step the process has reached in the foreground, and you can also see the log output in the background.

5. Open the API address

When the front end shows that prediction is being turned on

Open API address:

6. Voice cloning

Select the trained model, enter your text, and have fun.

4. Custom audio

1. Find data sets and create new data sets

2. Upload audio data

3. Modify the configuration and start

4. Bind a new input address

5. Open the workspace

In this way, you can see the newly bound data set in the sidebar on the right.

6. Training to fill in the newly bound address

Notebook Overview

Level

Beginner

Topic

Audio Generative AI

Related Notebooks

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

GPT-SoVITS Audio Synthesis Online Demo

1. Functional Description

2. Video Tutorial

3. Operation method

4. Custom audio

Notebook Overview

Build AI with AI

HyperAI Newsletters

Command Palette

GPT-SoVITS Audio Synthesis Online Demo

1. Functional Description

2. Video Tutorial

3. Operation method

4. Custom audio

Notebook Overview

Related Notebooks

Phi-4-reasoning-vision-15B Multimodal Reasoning Vision Model Demo

CPU Deployment of gpt-oss-20b-GGUF

ACE-Step 1.5: Music Generation Demo

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo

Build AI with AI

HyperAI Newsletters

Command Palette

GPT-SoVITS Audio Synthesis Online Demo

1. Functional Description

2. Video Tutorial

3. Operation method

4. Custom audio

Notebook Overview

Related Notebooks

Phi-4-reasoning-vision-15B Multimodal Reasoning Vision Model Demo

CPU Deployment of gpt-oss-20b-GGUF

ACE-Step 1.5: Music Generation Demo

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo

Build AI with AI

HyperAI Newsletters

Related Notebooks

Phi-4-reasoning-vision-15B Multimodal Reasoning Vision Model Demo

CPU Deployment of gpt-oss-20b-GGUF

ACE-Step 1.5: Music Generation Demo

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo

Related Notebooks

Phi-4-reasoning-vision-15B Multimodal Reasoning Vision Model Demo

CPU Deployment of gpt-oss-20b-GGUF

ACE-Step 1.5: Music Generation Demo

VibeVoice-ASR: Multifunctional End-to-End Speech Recognition Demo

Qwen3-TTS: High-Quality Controllable Multilingual Speech Synthesis Demo