Guowei-Zou commited on
Commit
88e75bb
·
verified ·
1 Parent(s): a3e0967

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +125 -0
README.md ADDED
@@ -0,0 +1,125 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - robotics
5
+ - robot-manipulation
6
+ - flow-matching
7
+ - diffusion
8
+ - reinforcement-learning
9
+ - robomimic
10
+ - one-step-inference
11
+ pipeline_tag: robotics
12
+ ---
13
+
14
+ # DM1: MeanFlow with Dispersive Regularization for 1-Step Robotic Manipulation
15
+
16
+ <div align="center">
17
+
18
+ [![Paper](https://img.shields.io/badge/Paper-arXiv-red)](https://arxiv.org/abs/2510.07865)
19
+ [![Code](https://img.shields.io/badge/Code-GitHub-black)](https://github.com/Guowei-Zou/dm1-release)
20
+ [![Project Page](https://img.shields.io/badge/Project-Page-blue)](https://guowei-zou.github.io/dm1/)
21
+
22
+ </div>
23
+
24
+ ## Model Description
25
+
26
+ DM1 is a novel flow matching framework for robotic manipulation that achieves **one-step inference** while maintaining high success rates. The model prevents representation collapse through dispersive regularization while achieving **20-40× faster inference** compared to diffusion baselines.
27
+
28
+ ### Key Features
29
+
30
+ - ⚡ **Single-Step Inference**: 0.07s per timestep (vs. 2-3.5s for diffusion)
31
+ - 🎯 **High Success Rates**: 10-20% improvement over diffusion baselines
32
+ - 🔧 **Dispersive Loss Family**: InfoNCE, Cosine, Hinge regularizers
33
+ - 👁️ **Vision-Ready**: Multi-view RGB observation support
34
+ - 🤖 **Real Robot Validated**: Tested on Franka-Emika-Panda
35
+
36
+ ## Model Variants
37
+
38
+ This repository contains pretrained weights for multiple configurations:
39
+
40
+ | Variant | Description | Best For |
41
+ |---------|-------------|----------|
42
+ | ShortCut + InfoNCE L2 | Flow matching with L2-based dispersive loss | General tasks |
43
+ | ShortCut + InfoNCE Cosine | Flow matching with cosine similarity | Vision tasks |
44
+ | ShortCut + Hinge | Flow matching with hinge loss | Robust control |
45
+ | ShortCut + Covariance | Flow matching with covariance regularization | Feature diversity |
46
+ | MeanFlow variants | Mean flow baseline and dispersive versions | Fast inference |
47
+ | ReFlow variants | Reflow baseline | Iterative refinement |
48
+
49
+ ## Supported Tasks
50
+
51
+ - **Robomimic (RGB)**: lift, can, square, transport
52
+ - **Franka Kitchen**: partial, complete, mixed
53
+ - **D3IL**: avoiding, pushing, sorting
54
+
55
+ ## Quick Start
56
+
57
+ ### Installation
58
+
59
+ ```bash
60
+ git clone https://github.com/Guowei-Zou/dm1-release.git
61
+ cd dm1-release
62
+ conda create -n dm1 python=3.8 -y
63
+ conda activate dm1
64
+ pip install -e .
65
+ ```
66
+
67
+ ### Download Checkpoints
68
+
69
+ ```python
70
+ from huggingface_hub import hf_hub_download
71
+
72
+ # Download specific checkpoint
73
+ checkpoint = hf_hub_download(
74
+ repo_id="zougw2025/dm1-pretrained",
75
+ filename="checkpoints/w_0p1/can/can_w0p1_05_shortcut_infonce_cosine.pt"
76
+ )
77
+ ```
78
+
79
+ ### Evaluation
80
+
81
+ ```bash
82
+ python script/run.py \
83
+ --config-dir=cfg/robomimic/eval/can \
84
+ --config-name=eval_shortcut_mlp_img \
85
+ base_policy_path=checkpoints/w_0p1/can/can_w0p1_05_shortcut_infonce_cosine.pt
86
+ ```
87
+
88
+ ## Performance Metrics
89
+
90
+ | Task | Baseline (32-128 steps) | DM1 (5 steps) | Improvement | Speedup |
91
+ |------|-------------------------|---------------|-------------|---------|
92
+ | Lift | ~85% | 99% | +14% | 20-40× |
93
+ | Can | Variable | High | +10-20% | 20-40× |
94
+ | Square | Moderate | Improved | +15-25% | 20-40× |
95
+ | Transport | Low | High | +20-30% | 20-40× |
96
+
97
+ ## Citation
98
+
99
+ If you use DM1 in your research, please cite:
100
+
101
+ ```bibtex
102
+ @misc{zou2025dm1meanflowdispersiveregularization,
103
+ title={DM1: MeanFlow with Dispersive Regularization for 1-Step Robotic Manipulation},
104
+ author={Guowei Zou and Haitao Wang and Hejun Wu and Yukun Qian and Yuhang Wang and Weibing Li},
105
+ year={2025},
106
+ eprint={2510.07865},
107
+ archivePrefix={arXiv},
108
+ primaryClass={cs.RO},
109
+ url={https://arxiv.org/abs/2510.07865},
110
+ }
111
+ ```
112
+
113
+ ## License
114
+
115
+ This project is licensed under the Apache License 2.0.
116
+
117
+ ## Acknowledgments
118
+
119
+ DM1 builds upon prior work including Diffusion Policy, ReinFlow, MeanFlow, FlowPolicy, D2PPO, and π0.5.
120
+
121
+ ## Contact
122
+
123
+ - **Author**: Guowei Zou
124
+ - **GitHub**: [dm1-release](https://github.com/Guowei-Zou/dm1-release)
125
+ - **Project Page**: [https://guowei-zou.github.io/dm1/](https://guowei-zou.github.io/dm1/)