4 months ago

Abstract

We introduce Lumina-DiMOO, an open-source foundational model for seamlessmulti-modal generation and understanding. Lumina-DiMOO sets itself apart fromprior unified models by utilizing a fully discrete diffusion modeling to handleinputs and outputs across various modalities. This innovative approach allowsLumina-DiMOO to achieve higher sampling efficiency compared to previousautoregressive (AR) or hybrid AR-Diffusion paradigms and adeptly support abroad spectrum of multi-modal tasks, including text-to-image generation,image-to-image generation (e.g., image editing, subject-driven generation, andimage inpainting, etc.), as well as image understanding. Lumina-DiMOO achievesstate-of-the-art performance on multiple benchmarks, surpassing existingopen-source unified multi-modal models. To foster further advancements inmulti-modal and discrete diffusion model research, we release our code andcheckpoints to the community. Project Page:https://synbol.github.io/Lumina-DiMOO.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

4 months ago

Yi Xin Qi Qin Siqi Luo Kaiwen Zhu Juncheng Yan Yan Tai Jiayi Lei Yuewen Cao Keqi Wang Yibin Wang

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

4 months ago

Yi Xin Qi Qin Siqi Luo Kaiwen Zhu Juncheng Yan Yan Tai Jiayi Lei Yuewen Cao Keqi Wang Yibin Wang

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Yi Xin Qi Qin Siqi Luo Kaiwen Zhu Juncheng Yan Yan Tai Jiayi Lei Yuewen Cao Keqi Wang Yibin Wang22 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Yi Xin Qi Qin Siqi Luo Kaiwen Zhu Juncheng Yan Yan Tai Jiayi Lei Yuewen Cao Keqi Wang Yibin Wang22 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Yi Xin Qi Qin Siqi Luo Kaiwen Zhu Juncheng Yan Yan Tai Jiayi Lei Yuewen Cao Keqi Wang Yibin Wang22 more

Abstract

Build AI with AI

HyperAI Newsletters

Yi Xin Qi Qin Siqi Luo Kaiwen Zhu Juncheng Yan Yan Tai Jiayi Lei Yuewen Cao Keqi Wang Yibin Wang

Yi Xin Qi Qin Siqi Luo Kaiwen Zhu Juncheng Yan Yan Tai Jiayi Lei Yuewen Cao Keqi Wang Yibin Wang

Yi Xin Qi Qin Siqi Luo Kaiwen Zhu Juncheng Yan Yan Tai Jiayi Lei Yuewen Cao Keqi Wang Yibin Wang