Command Palette
Search for a command to run...
DexFlyWheel Data Generation Framework
DexFlyWheel was proposed in September 2025 by Peking University, Harbin Institute of Technology, and PsiBot, and the relevant research results were published in the paper "DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation", was accepted as Spotlight by NeurIPS 2025.
DexFlyWheel is a scalable data generation framework employing a self-improving loop to continuously enrich data diversity. The framework has two key design features: IL + Residual RL for generating human-like and diverse data. Specifically, IL and Residual RL, combined with policy unrolling and data augmentation, form a self-improving loop. In each iteration, the policy generates trajectories, which are then enhanced in increasingly diverse scenarios and subsequently fed into the next iteration. This loop creates a flywheel effect, progressively expanding data diversity, enhancing policy generalization capabilities, and evolving into a robust, generalizable data generation agent.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.