Date

3 months ago

Organization

Paper URL

2505.22094

Tags

ReinFlow was jointly proposed in September 2025 by a research team from Carnegie Mellon University, Tsinghua University, and other universities and institutions. The relevant research results were published in the paper "...".ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement LearningIt has been selected for NeurIPS 2025.

ReinFlow is the first online reinforcement learning algorithm capable of stably fine-tuning a range of flow matching policies for a class of flow matching policies in continuous robot control. Based on rigorous RL theory, this paradigm injects learnable noise into the deterministic path of the flow policy, transforming the flow into a discrete-time Markov process, thereby enabling accurate and direct probability calculation. This transformation facilitates exploration and ensures training stability, allowing ReinFlow to stably fine-tune various flow model variants, especially with very few or even just one denoising step.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

3 months ago

Organization

Paper URL

2505.22094

Related Wiki

SAC Flow

SAC Flow achieves state-of-the-art performance in continuous control and robot operation benchmarks.

3 months ago

RewardMap, a multi-stage Reinforcement Learning Framework

RewardMap enhances the capabilities of multimodal large language models in structured vision tasks.

2 months ago

Fractal Forensics

FractalForensics exhibits good robustness and vulnerability to common image processing operations and Deepfake operations.

2 months ago

Group Variance Strategy Optimization GVPO

Given the limitations of existing fine-tuning techniques such as GRPO, GVPO has emerged as a reliable and versatile post-training paradigm.

3 months ago

NovaFlow, an Autonomous Operating Framework

NovaFlow is able to handle rigid, articulated, and deformable objects in different robot forms.

3 months ago

Normalized Spatiotemporal Gradient (NSG)

The NSG statistic quantifies the ratio of spatial probability gradient to temporal density change.

2 months ago

Discriminative Constraint Optimization Framework (DisCO)

A novel principle-based discriminative constraint optimization framework avoids difficulty bias and training instability.

2 months ago

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

By jointly aligning global and local features, adversarial examples can be effectively guided toward the target feature distribution and transferability can be enhanced.

2 months ago

Agent Entropy Balancing Strategy Optimization AEPO

AEPO focuses on balancing and rationalizing strategy extension branches and strategy updates under the guidance of high-entropy tool calls.

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

ReinFlow, an Online Reinforcement Learning Framework

Build AI with AI

HyperAI Newsletters

Command Palette

ReinFlow, an Online Reinforcement Learning Framework

Related Wiki

SAC Flow

RewardMap, a multi-stage Reinforcement Learning Framework

Fractal Forensics

Group Variance Strategy Optimization GVPO

NovaFlow, an Autonomous Operating Framework

Normalized Spatiotemporal Gradient (NSG)

Discriminative Constraint Optimization Framework (DisCO)

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

Agent Entropy Balancing Strategy Optimization AEPO

Build AI with AI

HyperAI Newsletters

Command Palette

ReinFlow, an Online Reinforcement Learning Framework

Related Wiki

SAC Flow

RewardMap, a multi-stage Reinforcement Learning Framework

Fractal Forensics

Group Variance Strategy Optimization GVPO

NovaFlow, an Autonomous Operating Framework

Normalized Spatiotemporal Gradient (NSG)

Discriminative Constraint Optimization Framework (DisCO)

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

Agent Entropy Balancing Strategy Optimization AEPO

Build AI with AI

HyperAI Newsletters

Related Wiki

SAC Flow

RewardMap, a multi-stage Reinforcement Learning Framework

Fractal Forensics

Group Variance Strategy Optimization GVPO

NovaFlow, an Autonomous Operating Framework

Normalized Spatiotemporal Gradient (NSG)

Discriminative Constraint Optimization Framework (DisCO)

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

Agent Entropy Balancing Strategy Optimization AEPO

Related Wiki

SAC Flow

RewardMap, a multi-stage Reinforcement Learning Framework

Fractal Forensics

Group Variance Strategy Optimization GVPO

NovaFlow, an Autonomous Operating Framework

Normalized Spatiotemporal Gradient (NSG)

Discriminative Constraint Optimization Framework (DisCO)

FOA-Attack, a Targeted migration-based Adversarial Attack Framework

Agent Entropy Balancing Strategy Optimization AEPO