HyperAI

Generate High-quality Articles in 5 Seconds, Llama 3-Chinese-Chat Demo Starts With One Click!

特色图像

Llama 3 has attracted much attention since its release and has been hailed as "the best open source big model to date." However, Llama 3 still has some problems in supporting Chinese. For example, when asking questions in Chinese, English replies or mixed Chinese and English replies often occur, which is very inconvenient.

Ask Llama 3-8B-Instruct in Chinese

To address this problem, Lai Xinlu from the Artificial Intelligence Department of Henan University of Technology developed the first Chinese version of Llama 3, the Llama 3-Chinese-Chat model, which can achieve fluent Chinese conversation and has superb Chinese comprehension ability.
Currently, "One-click deployment of Llama 3-Chinese-Chat-8B Demo" has been simultaneously launched on the HyperAI official website. You can start a conversation with just one click without entering any commands.

Ask Llama 3-Chinese-Chat-8B a question in Chinese

Tutorial address:

https://go.hyper.ai/ocpwC

Demo Run

1. Log in to hyper.ai, and on the Public Tutorials page, select One-click deployment of Llama 3-Chinese-Chat-8B Demo. Click Run this tutorial online.

2. After the page jumps, click "Clone" in the upper right corner to clone the tutorial into your own container.

3. Click "Next: Select Hashrate" in the lower right corner.

4. After the page jumps, select "NVIDIA GeForce RTX 4090" and click "Next: Review".New users can register using the invitation link below to get 4 hours RTX 4090 + 5 hours of free CPU!

HyperAI exclusive invitation link (copy and open in browser):https://openbayes.com/console/signup?r=6bJ0ljLFsFh_Vvej

5. Click "Continue" and wait for resources to be allocated. The first clone will take about 3 minutes. When the status changes to "Running", move the mouse to "API Address", copy the address and open it in a new tab to jump to the Llama 3-Chinese-Chat-8B Demo page.Please note that users must complete real-name authentication before using the API address access function.

If the issue persists for more than 10 minutes and remains in the "Allocating resources" state, try stopping and restarting the container. If restarting still does not resolve the issue, please contact the platform customer service on the official website.

Effect Demonstration

1. Open the Llama 3-Chinese-Chat-8B Demo page, enter text in the dialog box, click Send, and you can start the conversation.

2. Send the topic and requirements of this year's college entrance examination essay (national volume) to it, and it can quickly output a composition that meets the requirements.

3. The hyperparameter panel on the left represents:

* Maximum length:The maximum number of words that the model can output;

* Top P:Controls the range of candidate words selected from the probability distribution output by the model. A larger value means that a larger set of words will be considered during text generation.

* Temperature:A hyperparameter that controls randomness. The larger the value, the more random the generated text will be.

* Repetition Penalty:Controls the repetition in the generated text. A higher value means fewer occurrences of the same phrase or word in the text.