3 months ago

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan

Abstract

Dexterous manipulation is critical for advancing robot capabilities in real-world applications, yet diverse and high-quality datasets remain scarce. Existing data collection methods either rely on human teleoperation or require significant human engineering, or generate data with limited diversity, which restricts their scalability and generalization. In this paper, we introduce DexFlyWheel, a scalable data generation framework that employs a self-improving cycle to continuously enrich data diversity. Starting from efficient seed demonstrations warmup, DexFlyWheel expands the dataset through iterative cycles. Each cycle follows a closed-loop pipeline that integrates Imitation Learning (IL), residual Reinforcement Learning (RL), rollout trajectory collection, and data augmentation. Specifically, IL extracts human-like behaviors from demonstrations, and residual RL enhances policy generalization. The learned policy is then used to generate trajectories in simulation, which are further augmented across diverse environments and spatial configurations before being fed back into the next cycle. Over successive iterations, a self-improving data flywheel effect emerges, producing datasets that cover diverse scenarios and thereby scaling policy performance. Experimental results demonstrate that DexFlyWheel generates over 2,000 diverse demonstrations across four challenging tasks. Policies trained on our dataset achieve an average success rate of 81.9% on the challenge test sets and successfully transfer to the real world through digital twin, achieving a 78.3% success rate on dual-arm lift tasks.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

3 months ago

Robotics

Reinforcement Learning

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

3 months ago

Robotics

Reinforcement Learning

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan1 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan1 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan1 more

Abstract

Build AI with AI

HyperAI Newsletters

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan

Kefei Zhu Fengshuo Bai YuanHao Xiang Yishuai Cai Xinglin Chen Ruochong Li Xingtao Wang Hao Dong Yaodong Yang Xiaopeng Fan