HyperAI

Weekly Editor's Picks | 24/7 AI Live Room Is Here! One-click Use of GPT-SoVITS Voice Cloning Tutorial Is New

特色图像

📢📢 Good news! Good news! 📺「Super Neuro TV」 is now online on bilibili~

"Super Neuro TV" will broadcast practical courses and industry conferences in the field of AI "7×24h" non-stop.Every week we will publish a preview of next week’s program at the end of the article!

Click to see:http://live.bilibili.com/26483094

Whispering,Karpathy, a technical expert who left OpenAI some time ago The AI course "Let's Build a GPT Tokenizer" will be broadcast in the live broadcast room at 8 pm next Friday (March 15).In order to make it easier for everyone to experience the project, the project-related resources minbpe have also been launched on the hyper.ai official website. Come and watch and learn~

In addition, this week the hyper.ai official website has launched multiple text sentiment analysis datasets for everyone to download and use!

From March 4th to March 8th, hyper.ai official website updated quickly:

* High-quality public datasets: 10

* Community article selection: 4 articles

* Popular encyclopedia entries: 10

Visit the official website:hyper.ai

Selected public datasets

1. minbpe repository

This repository is Karpathy's minbpe project repository, which contains class-based Tokenizer implementations such as BaseTokenizer and BasicTokenizer. These classes are designed to provide basic functions for training, encoding and decoding, as well as utility functions such as saving and loading.

Direct use:

https://reurl.cc/97A8QY

2. D4 Chinese Dialogue Dataset for Depression Diagnosis Chat

D4 is a Chinese dialogue dataset for depression diagnosis chat. It is a new type of dialogue called task-based chat. The dataset consists of 1,339 multi-turn dialogues, including dialogue summaries and diagnosis results.

Direct use:

https://reurl.cc/krbAk3

3. dmsc_v2 movie review dataset

This dataset contains more than 2 million ratings or comments on 28 movies from more than 700,000 users. It is suitable for tasks such as recommendation systems, sentiment/opinion/comment analysis, etc.

Direct use:

https://reurl.cc/prbEOl

4. Amazon Amazon user review dataset

The dataset contains 7.2 million reviews or ratings from 1.42 million users on 520,000 products in more than 1,100 categories on Amazon, and can be used for tasks such as recommendation systems and sentiment/opinion/review tendency analysis.

Direct use:

https://reurl.cc/97A8AV

5. Dianping Dianping Dataset

This dataset contains 4.4 million reviews or ratings from 540,000 users on 240,000 restaurants. It can be used for tasks such as recommendation systems, sentiment/opinion/review tendency analysis, etc.

Direct use:

https://reurl.cc/eLbYrR

6. Weibo_senti_100k Sina sentiment microblog dataset

The dataset contains more than 100,000 Sina Weibo posts with sentiment annotations, including approximately 50,000 positive and negative comments.

Direct use:

https://reurl.cc/N487a6

7. Online_shopping_10_cats 10 categories of online shopping review dataset

This dataset contains online shopping reviews in 10 categories (books, tablets, mobile phones, fruits, shampoo, water heaters, Mengniu, clothes, computers, and hotels), with a total of more than 60,000 reviews, including about 30,000 positive and negative reviews. It can be used for sentiment/opinion/comment analysis tasks.

Direct use:

https://reurl.cc/qrbl8g

8. ChnSentiCorp Ctrip Hotel Review Dataset

The dataset contains more than 7,000 hotel review data from Ctrip.com, including more than 5,000 positive reviews and more than 2,000 negative reviews.

Direct use:

https://reurl.cc/QelvjM

9. Waimai_10k Food Delivery Platform User Evaluation Dataset

This dataset is from user reviews collected by a food delivery platform, including about 4,000 positive reviews and about 8,000 negative reviews. It can be used for sentiment/opinion/comment tendency analysis.

Direct use:

https://reurl.cc/K4zaQq

10. Four-shapes Four-shape dataset

The dataset contains 16,000 images of 4 shapes: square, star, circle, and triangle. Each image is 200×200 pixels. The data was collected using a Garmin Virb 1080p action camera.

Direct use:

https://reurl.cc/E4N9Zm

For more updated datasets this week, please visit:

https://hyper.ai/datasets

Community Articles

1. The accuracy rate reaches 91.74%! Southeast University proposed a photovoltaic cell defect detection model and introduced neural structure search for the first time

In order to simultaneously meet the accuracy and speed requirements of defect detection in the industrial field, Professor Zhang Jinxia's team from the School of Automation at Southeast University proposed an automatic defect detection model for photovoltaic cell electroluminescent images based on neural structure search and knowledge distillation, with an accuracy rate of 91.74%. Currently, the research results have been published on arXiv.

View the full report:

https://reurl.cc/97AK7x

2. Tutorial included: Voice cloning model GPT-SoVITS, 5 seconds of speech can clone a voice with a similarity of 95%

The founder of RVC (Retrieval based Voice Conversion) (GitHub account: RVC-Boss) has open-sourced a voice cloning project GPT-SoVITS, which has gained great popularity since its launch. With only a 5-second voice sample, you can get a cloned voice with a similarity of 80%~95%. Currently, the model deployment tutorial has been launched on the HyperAI official website.

Click to start cloning:

https://reurl.cc/97AKj8

3. During the two sessions, technology leaders mentioned AI4S/talent training/computing power/large models/industry general data sets...

It’s the “two sessions” again! The government work report clearly proposed to launch the “artificial intelligence +” action, and NPC deputies and CPPCC members from all walks of life also made suggestions on AI. This article sorts out the proposals and suggestions of some big names in the technology circle.

View the full report:

https://reurl.cc/lgbxGE

4. HyperAI Super Neural x DT New Materials丨Application of AI Technology in Chemistry and Materials Science Research Seminar Opens Registration

The "Seminar on the Application of AI Technology in Chemistry and Materials Science Research" will be held in Zhengzhou, Henan from April 23 to 25. Many industry experts will share their in-depth views on the challenges, prospects and opportunities of applying AI in the field of materials chemistry.

Interested developers can click the link below to sign up.

Registration link:

https://reurl.cc/QelMdb

Popular Encyclopedia Articles

1. Laplacian

2. World Modeling

3. Frames Per Second (FPS)

4. Diffusion Model

5. Human-in-the-loop (HITL)

Here are hundreds of AI-related terms compiled to help you understand "artificial intelligence" here:

https://hyper.ai/wiki

Super Neuro B Station Live Streaming Preview

datetimecontent
March 11
Monday
10:00NVIDIA's press conferences over the years
Tuesday, March 1210:0018:00Machine Learning Compilation Course Tesla AI Day
Wednesday, March 1310:00Google IO conferences over the years
Thursday, March 1410:00Stanford HAI Symposium
Friday, March 1510:0020:00MIT Deep Learning Course Let's build GPT Tokenizer
Saturday, March 1610:00Harvard CS50 Course (2023)
Sunday, March 1710:00Harvard CS50 Course (2022)

Super Neuro TV broadcasts live 24/7. Click to get the "electronic pickles" in the AI field:

http://live.bilibili.com/26483094

The above is all the content of this week’s editor’s selection. If you have resources that you want to include on the hyper.ai official website, you are also welcome to leave a message or submit an article to tell us!

See you next week!

About HyperAI

HyperAI (hyper.ai) is the leading artificial intelligence and high-performance computing community in China.We are committed to becoming the infrastructure in the field of data science in China and providing rich and high-quality public resources for domestic developers. So far, we have:

* Provide domestic accelerated download nodes for 1200+ public data sets

* Includes 300+ classic and popular online tutorials

* Interpretation of 100+ AI4Science paper cases

* Support 500+ related terms search

* Hosting the first complete Apache TVM Chinese documentation in China

Visit the official website to start your learning journey:

https://hyper.ai/