HyperAI

Microsoft and Tencent Compete in Technology, TRELLIS Leads the New Direction of Multi-format Support in the Field of 3D Generation; More Than 5k Questions Correct! VIS-Bench Allows AI to Learn "spatial Memory"

特色图像

Last November, Tencent launched the Hunyuan3D generative model, which is the industry's first open source large model that supports 3D generation from both text and images. Less than a month later, Microsoft released a new framework, TRELLIS, to join the competition in the field of 3D asset generation. TRELLIS supports multiple formats of output, including radiation field, 3D Gaussian and mesh, providing maximum flexibility for different needs.

Both models are now available on the hyper.ai official website. Come and try to see which one is better.

Use Hunyuan3D online:https://go.hyper.ai/Rsrno

Using TRELLIS online:https://go.hyper.ai/JE5s5

From January 6 to January 11, hyper.ai official website updates:

* High-quality public datasets: 10

* High-quality tutorial selection: 6

* Community Article Selection: 8 articles

* Popular encyclopedia entries: 5

* Top conferences with deadlines in January: 7

Visit the official website:hyper.ai

Selected public datasets

1. VSI-Bench Visual Spatial Intelligence Benchmark

The dataset contains more than 5k question-answer pairs, covering nearly 290 real indoor scene videos, involving a variety of environments such as residences, offices, and factories, and covering multiple issues such as object recognition, position relationship, and action prediction.

Direct use:https://go.hyper.ai/q0DYA

VSI-Bench Task Demonstration

2. Facial Feature Extraction Dataset Facial Feature Extraction Dataset

This dataset is a labeled dataset containing 750 images for detecting eyebrows, eyes, nose, lips, and beard regions of the face. The data labeling process was performed in Roboflow and exported from in YOLOv8 format.

Direct use:https://go.hyper.ai/O3kER

Dataset Example

3. Sentiment and Emotion Analysis Dataset Sentiment and Emotion Analysis Dataset

The dataset contains 422,000 sentiment analysis sentences and 3,309 emotion analysis sentences as supplements. The sentiment analysis is labeled with 6 different emotions: joy, sadness, anger, fear, love, and surprise.

Direct use:https://go.hyper.ai/wFNO6

4. Eurus-2-RL-Data Mathematical Programming Problem Training Dataset

This dataset is a high-quality dataset specifically used for reinforcement learning training. It is mainly used to solve mathematical and programming problems. It contains about 455k mathematical problems and 27k programming problems.

Direct use:https://go.hyper.ai/Wdo1k

5. Medical o1 Reasoning SFT Medical reasoning dataset

This dataset is designed for fine-tuning the HuatuoGPT-o1 medical language model to improve its performance in complex medical reasoning tasks. The construction of the dataset relies on GPT-4o, which ensures the accuracy and reliability of the data by searching for verifiable medical questions and verifying the answers using a medical verifier.

Direct use:https://go.hyper.ai/XMtXp

6. MCTS Chinese Text Simplified Dataset

The dataset contains 723 complex structured sentences selected from news corpus based on the Penn Chinese Treebank (CTB) standard, and each sentence is equipped with multiple manually simplified versions, making it the largest and most referenced evaluation dataset for the Chinese text simplification task.

Direct use:https://go.hyper.ai/UR3CN

7. educhat-sft-002-data-osm Educational dialogue dataset

The dataset contains 4 million data points, covering a variety of educational verticals, such as open question-and-answer, essay grading, heuristic teaching, emotional support, and course tutoring.

Direct use:https://go.hyper.ai/nQw0K

8. GOAT arithmetic task fine-tuning dataset

This dataset has two files: dataset.json and dataset.ipynb. The dataset.json file contains about 1.7 million synthetic data generated by dataset.ipynb for arithmetic tasks.

Direct use:https://go.hyper.ai/8ZAvG

9. NaturalProofs Mathematical Reasoning Dataset

The dataset is a multi-domain corpus for studying mathematical reasoning in natural language. It contains approximately 30k theorem statements and proofs, 15k definitions, and 2k additional pages (e.g., axioms, corollaries), all written in natural mathematical language.

Direct use:https://go.hyper.ai/Bk4WE

10. TransGPT-pt&sft traffic dialogue pre-training dataset

This dataset is part of TransGPT, the first comprehensive transportation model in China. It contains about 346,000 pieces of text data in the transportation field, which are used for pre-training in the field, and about 58,000 pieces of dialogue data in the transportation field, which are used for fine-tuning.

Direct use:https://go.hyper.ai/vuDHa

Selected Public Tutorials

1. Hunyuan3D: Generate 3D assets in just 10 seconds

Hunyuan3D is a 3D generative diffusion model, including a lightweight version and a standard version, both of which support the generation of high-quality 3D assets from text and image inputs. After qualitative and quantitative multi-dimensional evaluation, Hunyuan3D-1.0 performs very well in terms of geometric details, texture details, texture-geometry consistency, 3D rationality, and instruction compliance.

This tutorial is a lightweight version of Hunyuan3D. Click the link below and follow the tutorial instructions to experience 3D model generation.

Run online:https://go.hyper.ai/Rsrno

Effect examples

2.  TRELLIS: Microsoft's open source 3D asset generation model demo

TRELLIS is a graph neural network-based interpretability framework developed by the Microsoft team in 2024. It aims to provide efficient model interpretability by learning the characteristics of graph structured data.

The model and environment have been deployed. You can use the large model to convert images into 3D images according to the tutorial instructions.

Run online:https://go.hyper.ai/JE5s5

Effect examples

3.Quick deployment of ChatGLM2-6b-32k

ChatGLM-6B is an open-source, bilingual Chinese-English conversational language model based on the General Language Model (GLM) architecture with 6.2 billion parameters. Combined with model quantization technology, users can perform local (INT4 quantization) on consumer-grade graphics cards with as little as 6 GB of video memory.

Follow the tutorial steps and directly copy the generated API address to use ChatGLM-6B.

Run online:https://go.hyper.ai/B0b7V

Model web interface

4. Natural Language Processing with NLTK

NLTK is one of the most popular platforms for creating Python programs that work with natural language data. In addition to text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning, it provides simple interfaces to more than 50 large structured text datasets (corpora) and lexical resources.

This tutorial shows how to use NLTK to perform various NLP operations at the text processing stage and create a Keras model with the help of some NLTK tools for sentiment analysis text classification.

Run online:https://go.hyper.ai/BFZ10

Word frequency distribution example

5. Audio LDM Audio Editing Tutorial

AudioLDM is a latent text-to-audio diffusion model capable of generating realistic audio samples given any textual input. AudioLDM takes a textual prompt as input and predicts the corresponding audio. It can generate text-conditioned sound effects, human speech, and music.

This project can generate a front-end interactive interface through the Gradio interface. The relevant models and dependencies have been deployed. Click the link below to edit the audio.

Run online:https://go.hyper.ai/BCOWL

6. ShowUI: A vision-language-action model focusing on GUI automation

The ShowUI model understands the content of the screen interface and performs interactive actions such as clicking, inputting, scrolling, etc. It supports web and mobile application scenarios and can automatically complete complex user interface tasks. ShowUI can parse screenshots and user instructions to predict interactive actions on the interface.

This tutorial is a one-click deployment demo of the model. You only need to clone and start the container and directly copy the generated API address to experience the model.

Run online:https://go.hyper.ai/reHs7

Model web interface

💡We have also established a Stable Diffusion tutorial exchange group. Welcome friends to scan the QR code and remark [SD tutorial] to join the group to discuss various technical issues and share application results~

Community Articles

1. A collection of 10 great Christmas datasets: popular songs/gift wish lists/recipes/10 years of Christmas Twitter copywriting…

During the Christmas season, HyperAI has prepared 10 Christmas-related data sets for you, which support online use and accelerated downloading. Come and experience them.

View the dataset summary:https://go.hyper.ai/if7Lc

2. Costs dropped 100 times! Nonprofit E11 Bio's new results map the connections between millions of cells in the brain

Biotech company E11 Bio has launched PRISM technology, which can map the connections between millions of cells in the entire brain at a very low cost. This series of innovations is expected to reduce the overall cost of whole-brain connectomics by at least 100 times, making it possible to explore the human brain in the future. This article is a detailed report on the company, click to read it quickly.

View the full report:https://go.hyper.ai/ISc4j

3. Multinational policies/scientific breakthroughs/pioneers/enterprise layout...all the major events of AI4S industry in 2024 in one article

HyperAI has compiled and reviewed the high-impact events in the field of AI for Science in 2024. Click here to view the detailed report.

View the full report:https://go.hyper.ai/d2Dlv

4. Global warming is difficult to reverse. Stanford team uses AI to predict the highest temperature changes, with a record-breaking probability of 90%

Against the backdrop of continued global warming, rare extreme climate events have begun to occur frequently. Research teams from Stanford University, Colorado State University, and ETH Zurich used artificial intelligence convolutional neural network systems to predict global warming and found that even if we can achieve rapid emissions reductions, global temperatures are still likely to continue to rise by 90%. This article is a detailed interpretation and sharing of the paper.

View the full report:https://go.hyper.ai/vDt3e

5. Purdue University team achieves data-efficient tactile representation for robot learning by simulating human reactive grasping

Xu Zhengtong, a third-year doctoral student at Purdue University, shared his two major research results, the reactive grasping controller LeTac-MPC and UniT for unified tactile representation of robots, with the topic of "Data-efficient tactile representation for robot learning". This article is a summary of the sharing content, click to read it quickly.

View the full report:https://go.hyper.ai/IPIjj

6. Five battles for CASP, a benchmark for protein structure prediction, Zheng Wei from Nankai University: competitiveness and difficulty increase, focus on practical biological problems

CASP has always been regarded by the industry as a benchmark for protein structure prediction. In this context, HyperAI had the honor of having an in-depth interview with Professor Zheng Wei. Through CASP, an international competition with industry benchmark significance, he analyzed the current development trend in the field of protein structure prediction for us. Full of dry goods, click to read quickly.

View the full report:https://go.hyper.ai/Y83iz

7. After 30 years of perseverance, MIT aims at the next generation of lithium batteries and uses generative AI to achieve a major breakthrough in solid electrolytes

In a recent study, a research team from MIT and Toyota Research Institute deeply studied the complexity of different advanced generative models in polymer generation and proposed a de novo design method that can continuously generate and evaluate new polymer electrolytes based on GPT and diffusion models, providing new candidates for experimental verification. This article is a detailed interpretation and sharing of the paper.

View the full report:https://go.hyper.ai/PDc8J

8. Just now! The official price of the world's fastest GPU RTX 5090 is over 14,000 yuan, and the price of the castrated version of 5090 D in China is over 16,000 yuan

At the CES 2025 conference, Nvidia officially announced the RTX 5090, which is the fastest GeForce RTX GPU to date. The initial price is 14,000 yuan, and the Chinese version 5090 D is priced at 16,000 yuan. The RTX 5090 also adds FP4 support, which takes up less memory space and runs generative AI models 2 times faster than the previous generation. This article is a detailed introduction to the product, click to read it quickly.

View the full report:https://go.hyper.ai/dyyZS

Popular Encyclopedia Articles

1. Nuclear Norm

2. Paired t-Test

3. Large-scale Multi-task Language Understanding (MMLU)

4. Sigmoid function

5. The least square method

Here are hundreds of AI-related terms compiled to help you understand "artificial intelligence" here:

https://go.hyper.ai/wiki

January deadline for the top conference

One-stop tracking of top AI academic conferences:https://go.hyper.ai/event

The above is all the content of this week’s editor’s selection. If you have resources that you want to include on the hyper.ai official website, you are also welcome to leave a message or submit an article to tell us!

See you next week!

About HyperAI

HyperAI (hyper.ai) is the leading artificial intelligence and high-performance computing community in China.We are committed to becoming the infrastructure in the field of data science in China and providing rich and high-quality public resources for domestic developers. So far, we have:

* Provide domestic accelerated download nodes for 1700+ public data sets

* Includes 500+ classic and popular online tutorials

* Interpretation of 200+ AI4Science paper cases

* Supports 600+ related terms search

* Hosting the first complete Apache TVM Chinese documentation in China

Visit the official website to start your learning journey:

https://hyper.ai

Finally, I recommend a "Creator Incentive Program". Interested friends can scan the QR code to participate!