HyperAI

The First Hyperspectral Salient Object Detection Benchmark Dataset Is Online; Participate in the Creator Incentive Program and Win a 100% Cash Prize!

特色图像

This week, OpenAI and Google successively launched nuclear bomb-level products in press conferences. OpenAI still likes to grab the news spotlight as usual, and released GPT-4o before the Google I/O conference. And Google also frequently launched big moves to confront OpenAI head-on.

In terms of product releases this week, OpenAI and Google's products each have their own advantages in the field of artificial intelligence. HyperAI has summarized the key points of the two conferences for you, come and check it out!

From May 13th to May 17th, hyper.ai official website updates:

* High-quality public datasets: 10

* Selection of high-quality tutorials: 3

* Community article selection: 4 articles

* Popular encyclopedia entries: 5

Visit the official website:hyper.ai

Selected public datasets

1. DVQA Image Question Answering Dataset

DVQA is a dataset for testing bar graph understanding in a question-answering framework, covering a large number of common bar graph styles and containing approximately 3.48 million question-answer pairs corresponding to 300,000 images.

Direct use:https://go.hyper.ai/p4183

2. Exoplanets exoplanet dataset

The dataset contains information about 4,856 exoplanets, such as name, mass, year of discovery, distance from Earth, etc.

Direct use:https://go.hyper.ai/OcA3h

3. Safebooru Anime Image Metadata Set

Safebooru is a tag-based image archive maintained by anime fans, containing 3 million lines of tag-based anime image metadata. It allows users to post images and add tags, annotations, translations, and comments.

Direct use: https://go.hyper.ai/IyCRb

4. Marvel Universe Social Network Dataset

This dataset contains Marvel heroes, comics, and the relationships between them. It can be used to study the social network in Marvel hero comics.

Direct use: https://go.hyper.ai/9GQuu

5. Marvel Comic Books

This dataset contains information about all comic books ever published in the Marvel Universe, such as name, author, active year, and price.

Direct use: https://go.hyper.ai/dRBXp

6. Star Wars Movie Scripts Dataset

Star Wars is a popular movie series that takes place in a galaxy far, far away. There are 9 movies in the series. This dataset contains a collection of script dialogues between characters from movies 4-6. This data can be used for text mining and linguistics.

Direct use: https://go.hyper.ai/7g25U

7. WIE World Important Events Dataset

The dataset spans important historical milestones from ancient times to modern times, covering a wide range of global events. It provides a comprehensive timeline of events that shaped the world, offering insights into wars, cultural shifts, technological advances, and social movements.

Direct use: https://go.hyper.ai/Dw3tT

8. HSOD-BIT-V1 Hyperspectral Salient Object Detection Benchmark Dataset

HSOD-BIT is the first large-scale, high-quality hyperspectral salient object detection benchmark dataset, which aims to achieve higher accuracy in salient object detection tasks by leveraging the advantages of spectral information. Aiming at the data requirements of contemporary deep learning models, the dataset provides pixel-level manual annotations for 319 hyperspectral data cubes and generates corresponding pseudo-color images.

Direct use:https://go.hyper.ai/HGnjm

9. AFHQ Animal Face Image Dataset

The dataset consists of 16,130 high-quality images with a resolution of 512×512, including animal face images of three categories: cats, dogs, and wild animals.

Direct use:https://go.hyper.ai/AbByr

10. WikiArt Faces Artistic Faces Dataset

The dataset contains hundreds of face images in 26 styles selected from WikiArt works, with varying degrees of realism and stylization. The images are not of uniform size or aspect ratio, so some preprocessing (padding, scaling, etc.) is required for most applications.

Direct use:https://go.hyper.ai/OO5RX

For more public datasets, please visit:

https://hyper.ai/datasets

Selected Public Tutorials

1. Online Tutorial | Turing Award winner Yann LeCun praised! Xiaohongshu open-sources InstantID, allowing you to customize multiple styles of photos with just one original photo

InstantID is an image generation solution based on a diffusion model that can quickly generate a variety of stylized portraits from a single reference image. Just upload a selfie and get a customized AI portrait in 20 seconds.

Run online:https://go.hyper.ai/Eq4zl

2. ComfyUI InstantID Workflow Online Tutorial

ComfyUI InstantID workflow can transform ordinary portrait photos into works of art with highly personalized style. In the process of style transfer, the algorithm combines the content of one image with the style of another image to generate a new image that retains the content of the original image and has the artistic style of the other image. This technology is widely used in fields such as art creation, image processing and visual effects.

Run online: https://go.hyper.ai/djKs3

3. Preprocessing audio data using the GTZAN dataset

This tutorial will explain and demonstrate all possible features extracted from audio data that can aid in model training using the GTZAN dataset.

Run online:https://go.hyper.ai/9Teed 

We have also established a Stable Diffusion tutorial exchange group. Welcome friends to join the group to discuss various technical issues and share application results~

Community Articles

1. GPT-4o is a dragon-slaying shocker! Multimodal, real-time interactive, free for all, smooth voice interactive ChatGPT is a winner

OpenAI released its latest flagship model GPT-4o on Tuesday, which is free for all users! The model provides GPT-4-level AI capabilities, accepts any combination of text, audio, and image input, and can generate any combination of text, audio, and image output. Based on GPT-4o, the upgraded ChatGPT supports real-time interaction, and the reaction time is similar to that of human conversation. For more first-hand reports and model effect demonstrations, please see this article.

Get detailed information:https://go.hyper.ai/7SZEG

2. Google Gemini is here! Strengthening the search moat and empowering the whole family, Gemini 1.5 Pro is upgraded to 2 million tokens

This article summarizes several highlights of the Google I/O conference: Gemini 1.5 Pro was upgraded to 2 million tokens, the lightweight model Gemini 1.5 Flash was launched, and Google Search was integrated with AI Overviews to provide information overviews and assist users in life planning. For more first-hand reports, please see this article.

View the full report:https://go.hyper.ai/vqptC

3. You can get up to 300 yuan in cash! Share your practical experience and you can easily get rewards

The second phase of the OpenBayes Bayesian computing platform "Creator Incentive Program" is now online. Come and join HyperAI! Whether you are a veteran in deep learning, data science or large models, or a beginner; whether you are a veteran with rich creative experience, or a newcomer to the world of social media; as long as you are willing to share, you will get rewards, and the chance of winning is 100%!

Full event information:https://go.hyper.ai/3uBMM

4. Don’t worry about space objects “colliding with the Earth”! The Chinese Academy of Sciences team proposed a new method for identifying coronal mass ejections, which is close to human identification results

A research team from the Key Laboratory of Solar Activity and Space Weather at the National Space Science Center of the Chinese Academy of Sciences proposed a CME identification and parameter acquisition method based on machine learning. Compared with the manual CME catalog and some classic automatic identification methods, this method is efficient and fast, can identify relatively weak CME signals, and give accurate CME morphology information. This article is a detailed interpretation and sharing of the research.

View the full report:https://go.hyper.ai/0PFQD

Popular Encyclopedia Articles

1. Recall Rate

2. Masked Language Modeling (MLM)

3. Extended Long Short-Term Memory xLSTM

4. Kolmogorov-Arnold Representation Theorem

5. Kolmogorov-Arnold Networks

Here are hundreds of AI-related terms compiled to help you understand "artificial intelligence" here:

https://hyper.ai/wiki

Station B live broadcast preview

Apple will hold WWDC 2024 from June 10th to 14th. To help you learn more about Apple,The Super Neuro B Station live broadcast room will continue to broadcast the "Apple Special" video from next week.Involves: Past WWDC conferences, executive interviews, related documentaries and other rich content.

The following table is a preview of the content selected by the editor↓↓↓

datetimecontent
May 20
Monday
18:00Steve Jobs
Tuesday, May 2118:00Jobs Interview Collection 1990
Wednesday, May 2218:00Interview with Steve Jobs vs Bill Gates
Thursday, May 2318:00iPhone first release
Friday, May 2418:00History of Steve Jobs
Saturday, May 2518:00How Apple survived nearly bankruptcy
Sunday, May 2618:00Tim Cook's History

Super Neuro TV broadcasts live 24/7. Click to get the "electronic pickles" in the AI field:

http://live.bilibili.com/26483094

The above is all the content of this week’s editor’s selection. If you have resources that you want to include on the hyper.ai official website, you are also welcome to leave a message or submit an article to tell us!

See you next week!

About HyperAI

HyperAI (hyper.ai) is the leading artificial intelligence and high-performance computing community in China.We are committed to becoming the infrastructure in the field of data science in China and providing rich and high-quality public resources for domestic developers. So far, we have:

* Provide domestic accelerated download nodes for 1200+ public data sets

* Includes 300+ classic and popular online tutorials

* Interpretation of 100+ AI4Science paper cases

* Support 500+ related terms search

* Hosting the first complete Apache TVM Chinese documentation in China

Visit the official website to start your learning journey:

https://hyper.ai