8 months ago

Abstract

Rapid progress in text-to-motion generation has been largely driven bydiffusion models. However, existing methods focus solely on temporal modeling,thereby overlooking frequency-domain analysis. We identify two key phases inmotion denoising: the semantic planning stage and the fine-grainedimproving stage. To address these phases effectively, we proposeFrequency enhanced text-to-motion diffusion model(Free-T2M), incorporating stage-specific consistency losses that enhancethe robustness of static features and improve fine-grained accuracy. Extensiveexperiments demonstrate the effectiveness of our method. Specifically, onStableMoFusion, our method reduces the FID from 0.189 to 0.051,establishing a new SOTA performance within the diffusion architecture. Thesefindings highlight the importance of incorporating frequency-domain insightsinto text-to-motion generation for more precise and robust results.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

8 months ago

Chen Wenshuo ; Jia Haozhe ; Lai Songning ; Wu Keming ; Xiao Hongru ; Hu Lijie ; Yue Yutao

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

8 months ago

Chen Wenshuo ; Jia Haozhe ; Lai Songning ; Wu Keming ; Xiao Hongru ; Hu Lijie ; Yue Yutao

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss | Papers | HyperAI

Command Palette

Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss

Chen Wenshuo ; Jia Haozhe ; Lai Songning ; Wu Keming ; Xiao Hongru ; Hu Lijie ; Yue Yutao

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss

Chen Wenshuo ; Jia Haozhe ; Lai Songning ; Wu Keming ; Xiao Hongru ; Hu Lijie ; Yue Yutao

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss

Chen Wenshuo ; Jia Haozhe ; Lai Songning ; Wu Keming ; Xiao Hongru ; Hu Lijie ; Yue Yutao

Abstract

Build AI with AI

HyperAI Newsletters