HyperAIHyperAI

Command Palette

Search for a command to run...

BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation

Seyed Rohollah Hosseyni Ali Ahmad Rahmani Seyed Jamal Seyedmohammadi Sanaz Seyedin Arash Mohammadi

Abstract

Autoregressive models excel in modeling sequential dependencies by enforcingcausal constraints, yet they struggle to capture complex bidirectional patternsdue to their unidirectional nature. In contrast, mask-based models leveragebidirectional context, enabling richer dependency modeling. However, they oftenassume token independence during prediction, which undermines the modeling ofsequential dependencies. Additionally, the corruption of sequences throughmasking or absorption can introduce unnatural distortions, complicating thelearning process. To address these issues, we propose BidirectionalAutoregressive Diffusion (BAD), a novel approach that unifies the strengths ofautoregressive and mask-based generative models. BAD utilizes apermutation-based corruption technique that preserves the natural sequencestructure while enforcing causal dependencies through randomized ordering,enabling the effective capture of both sequential and bidirectionalrelationships. Comprehensive experiments show that BAD outperformsautoregressive and mask-based models in text-to-motion generation, suggesting anovel pre-training strategy for sequence modeling. The codebase for BAD isavailable on https://github.com/RohollahHS/BAD.


Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation | Papers | HyperAI