$FlowRAM$: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation

Sen Wang^1*, Le Wang^1†, Sanping Zhou¹, Jingyi Tian¹, Jiayi Li¹, Haowen Sun¹, Wei Tang²

¹National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, Xi’an Jiaotong University

²University of Illinois at Chicago

Project Page | ArXiv | Blog (In Chinese)

Abstract

Robotic manipulation in high-precision tasks is essential for numerous industrial and real-world applications where accuracy and speed are required. Yet current diffusion-based policy learning methods generally suffer from low computational efficiency due to the iterative denoising process during inference. Moreover, these methods do not fully explore the potential of generative models for enhancing information exploration in 3D environments. In response, we propose FlowRAM, a novel framework that leverages generative models to achieve region-aware perception, enabling efficient multimodal information processing. Specifically, we devise a Dynamic Radius Schedule, which allows adaptive perception, facilitating transitions from global scene comprehension to fine-grained geometric details. Furthermore, we integrate state space models to integrate multimodal information, while preserving linear computational complexity. In addition, we employ conditional flow matching to learn action poses by regressing deterministic vector fields, simplifying the learning process while maintaining performance. We verify the effectiveness of the FlowRAM in the RLBench, an established manipulation benchmark, and achieve state-of-the-art performance. The results demonstrate that FlowRAM achieves a remarkable improvement, particularly in high-precision tasks, where it outperforms previous methods by 12.0% in average success rate. Additionally, FlowRAM is able to generate physically plausible actions for a variety of real-world tasks in less than 4 time steps, significantly increasing inference speed.

💻 Installation

See install.md for installation instructions.

📚 Data

FlowRAM leverages the RLBench framework to generate expert demonstrations, including precision-focused tasks for high-accuracy manipulation. Generated data is saved in:

$YOUR_REPO_PATH/FlowRAM/data/

We follow RLBench’s data generation pipeline for consistency and scalability.

🛠️ Usage

Scripts for training and evaluation are included in the scripts/ & online_evaluation_rlbench/ directory.

Train FlowRAM in GNFactor setup:
```
bash scripts/gnfactor_train.sh
```
Train FlowRAM in Precise setup:
```
bash scripts/precise_train.sh
```

Evaluate a policy:

bash online_evaluation_rlbench\eval_peract.sh

🤖 Real-world Deployments

FlowRAM supports deployment on a 6-DoF UR5 arm with Robotiq gripper, achieving robust manipulation across six real-world tasks.

🚧 TODO

📝 Formatting code for release
📦 Open-sourcing pretrained weights
⏳ Currently working on other projects, will release when time permits.

🏷️ License

This repository is licensed under the MIT License.

🙏 Acknowledgements

Our work builds on 3D Diffuser Actor, PointMamba, and Mamba. We thank these projects for their inspiring contributions.

👍 Citation

@inproceedings{wang2025flowram,
  title={FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation},
  author={Wang, Sen and Wang, Le and Zhou, Sanping and Tian, Jingyi and Li, Jiayi and Sun, Haowen and Tang, Wei},
  booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
  pages={12176--12186},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
asserts		asserts
data_preprocessing		data_preprocessing
datasets		datasets
diffuser_actor		diffuser_actor
online_evaluation_rlbench		online_evaluation_rlbench
scripts		scripts
tasks		tasks
utils		utils
KNN_CUDA-0.2-py3-none-any.whl		KNN_CUDA-0.2-py3-none-any.whl
LICENSE		LICENSE
README.md		README.md
causal_conv1d-1.0.0+cu118torch1.13cxx11abiFALSE-cp39-cp39-linux_x86_64.whl		causal_conv1d-1.0.0+cu118torch1.13cxx11abiFALSE-cp39-cp39-linux_x86_64.whl
engine.py		engine.py
environment.yaml		environment.yaml
evaluate_policy.py		evaluate_policy.py
insatll.md		insatll.md
main_trajectory.py		main_trajectory.py
switch-cuda.sh		switch-cuda.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

$FlowRAM$: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation

Sen Wang^1*, Le Wang^1†, Sanping Zhou¹, Jingyi Tian¹, Jiayi Li¹, Haowen Sun¹, Wei Tang²

¹National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, Xi’an Jiaotong University

²University of Illinois at Chicago

Project Page | ArXiv | Blog (In Chinese)

Abstract

💻 Installation

📚 Data

🛠️ Usage

🤖 Real-world Deployments

🚧 TODO

🏷️ License

🙏 Acknowledgements

👍 Citation

About

Uh oh!

Releases

Packages

Languages

License

SanMumumu/FlowRAM

Folders and files

Latest commit

History

Repository files navigation

$FlowRAM$: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation

Sen Wang1*, Le Wang1†, Sanping Zhou1, Jingyi Tian1, Jiayi Li1, Haowen Sun1, Wei Tang2

1National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, Xi’an Jiaotong University

2University of Illinois at Chicago

Project Page | ArXiv | Blog (In Chinese)

Abstract

💻 Installation

📚 Data

🛠️ Usage

🤖 Real-world Deployments

🚧 TODO

🏷️ License

🙏 Acknowledgements

👍 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Sen Wang^1*, Le Wang^1†, Sanping Zhou¹, Jingyi Tian¹, Jiayi Li¹, Haowen Sun¹, Wei Tang²

¹National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, Xi’an Jiaotong University

²University of Illinois at Chicago

Packages