HyperAIHyperAI

Command Palette

Search for a command to run...

ByteDance open-sources Lance, a 3B Model Encompassing Understanding, Generation, and Editing; the National University of Singapore Proposes the ViMU Dataset: Covering 588 Videos and non-verbal Question answering.

Featured Image

Lance, released by ByteDance in 2026, is a native unified multimodal model. Employing a 3B activity parameter design, it can simultaneously perform image and video understanding, generation, and editing within a single framework. This model achieves capability sharing across text, image, and video tasks through unified multimodal representation and multi-task collaborative training. Its core utilizes a two-stream hybrid expert (MoE) architecture and modality-aware rotational position encoding (MaPE), achieving unified context learning on shared interleaved multimodal sequences while cleverly decoupling the capability paths of understanding and generation. Combined with a phased multi-task training strategy, Lance significantly surpasses existing open-source unified models in terms of image and video generation quality while maintaining excellent multimodal semantic understanding capabilities.

The HyperAI website now features "Lance: Unifying Multimodal Understanding, Generation, and Editing Models," so come and try it out!

Online use:https://go.hyper.ai/Okkmw

Welcome to visit our official website for more information:

https://hyper.ai

A quick overview of hyper.ai's official website updates from May 23rd to May 29th:

* High-quality public datasets: 3

* Selection of high-quality tutorials: 3

* Community article interpretation: 3 articles

* Popular encyclopedia entries: 5

Visit the official website:hyper.ai

Selected public datasets

1. ViMU Video Metaphor Understanding Dataset

ViMU is a benchmark dataset for video metaphor understanding released by the National University of Singapore in 2026. It aims to evaluate the ability of multimodal large models to understand deep semantic meanings of video metaphors.

Online use:https://go.hyper.ai/0DIpe

2. Rice Leaf Diseases Dataset

Rice Leaf Disease Detection is a rice leaf image dataset specifically designed for precision agriculture target detection tasks. It is widely used in applications such as YOLO model training, agricultural disease detection, edge vision deployment, and intelligent rice planting management. This dataset contains 8,665 rice leaf images, covering 9 categories, including healthy rice leaves and 8 common diseases: bacterial leaf blight, brown spot, rice leaf roller damage, rice blast, leaf scorch, leaf smut, narrow brown spot, and neck blast.

Online use:https://go.hyper.ai/IXOlY

Dataset Example

3. MRI Brain Neurodegenerative Diseases Dataset

MRI Brain Neurodegenerative Diseases is an MRI dataset designed for research and medical image analysis of neurodegenerative diseases of the brain. It is widely used in research areas such as disease classification, medical image recognition, and deep learning model training. The dataset contains 2,846 brain MRI images with a resolution of 512 × 512, organized according to two imaging weights and four main categories.

Online use:https://go.hyper.ai/VpFoh

Dataset Example

Selected Public Tutorials

1. Lance: A unified model for multimodal understanding, generation, and editing.

Lance, released by ByteDance in 2026, is a 3B-scale native unified multimodal model designed for tasks such as image understanding, video understanding, text-to-image generation, text-to-video generation, image editing, and video editing. Lance's key feature is that it processes understanding, generation, and editing within the same model framework, enabling text, image, and video tasks to share a unified multimodal representation. It can generate images or videos from text, perform visual editing by combining input images, input videos, and text instructions, and perform question answering, description, and reasoning on images and videos.

Run online:https://go.hyper.ai/Okkmw

Demo Page

2. HY-World-2.0 World Model

HY-World-2.0 is a multimodal world model framework launched by Tencent in 2026. Unlike world models that only generate pixel videos (such as Genie 3 and Cosmos), HY-World-2.0 directly generates realistic 3D assets (mesh/3DGS), which are editable, persistent, and can be directly imported into game engines such as Blender, Unity, and Unreal Engine.

Run online:https://go.hyper.ai/ZQpHM

Demo Page

3. AutoFigure: An LLM-based automatic figure generation system for academic papers

AutoFigure is an intelligent academic illustration generation system developed by the ResearchAI team at Westlake University and published at ICLR 2026. This system utilizes a large language model (LM) through an iterative optimization mechanism to automatically generate high-quality scientific illustrations that meet publication standards from text descriptions or research papers. It supports both SVG vector graphics and mxGraph XML (fully compatible with draw.io) output formats.

Run online:https://go.hyper.ai/ZrWS4

Demo Page

Community article interpretation

1. CVEvolve, a zero-code, self-discovery scientific image processing algorithm proposed by Argonne National Laboratory, possesses full-stack capabilities including coding, result self-checking, and strategy optimization.

A research team at Argonne National Laboratory (ANL) in the United States has developed a zero-code autonomous agent framework called CVEvolve after systematically analyzing past AI-based automated work. This framework is designed to discover algorithms needed for scientific data processing. It possesses strong versatility, requiring no pre-defined problem architecture or fixed process templates. It can achieve closed-loop linkage of various elements such as code, data, evaluation metrics, retrieval records, and visualization results, supporting the development of executable algorithms for computer vision, image processing, and other fields.

View the full report:https://go.hyper.ai/UBS5q

2. In just 30 minutes, the biological multi-agent Robin successfully integrated 550 research papers, establishing an autonomous research loop and identifying dAMD candidate therapies.

A joint team from FutureHouse in San Francisco, the University of Oxford, and Fordham University has proposed the Robin biological multi-agent system. This is the first biomedical intelligent system to simultaneously integrate scientific hypothesis generation and experimental data analysis capabilities, achieving a continuous closed-loop workflow.

View the full report:https://go.hyper.ai/KnYpQ

3. Scientists have independently generated novel materials by reverse-engineering gallium-containing materials using a Bayesian optimization framework. The optimization results exhibit uniqueness and novelty.

A research team led by Flinders University in collaboration with Khalifa University in the UAE has proposed a machine learning-guided Bayesian optimization (BO) framework that enables the reverse design of gallium-based compositions with predetermined electronic properties while maintaining chemical rationality. Analytical results after optimization show that the generated material possesses 100% uniqueness and novelty relative to the training data, and that the SMACT effectiveness is significantly improved within the 1.5–2.5 eV bandgap range.

View the full report:https://go.hyper.ai/kXS7f

Popular Encyclopedia Articles

1. HyperNetworks

2. Confusion Matrix

3. Prompt Engineering

4. Learning While Deploying

5. Reciprocal Rank Fusion

Here are hundreds of AI-related terms compiled to help you understand "artificial intelligence" here:

https://go.hyper.ai/wiki

The above is all the content of this week’s editor’s selection. If you have resources that you want to include on the hyper.ai official website, you are also welcome to leave a message or submit an article to tell us!

See you next week!

About HyperAI

HyperAI (hyper.ai) is the leading artificial intelligence and high-performance computing community in China.We are committed to becoming the infrastructure in the field of data science in China and providing rich and high-quality public resources for domestic developers. So far, we have:

* Provides domestic accelerated download nodes for 2100+ public datasets

* Includes 700+ classic and popular online tutorials

* Analyzing 300+ AI4Science Paper Cases

* Supports searching for 700+ related terms

* Hosting the first complete Apache TVM Chinese documentation in China

Visit the official website to start your learning journey:

https://hyper.ai