Skip to content

jhz1192/VR-DP3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Information Filtering via Variational Regularization for Robot Manipulation

Paper License


📋 Overview

Variational Regularization Module
Variational Regularization Module

Our proposed method, Variational Regularization (VR), adaptively filters out noise and redundant information from features. left: Our method, built on DP3, introduces a Variational Regularization module immediately after the last downsampling features in the U-Net decoder, where noise is most likely to accumulate, enabling effective information filtering. right Architecture of the Variational Regularization module: it modulates the features conditioned on the diffusion timestep, then predicts the feature-wise mean and standard deviation, and uses the reparameterization trick to obtain the filtered features.

⚙️ Installation

Our code is intended to be evaluated with the third-party benchmark RoboTwin2.0. To reproduce results, you need to copy our policy folder into the RoboTwin repository and run RoboTwin’s training and evaluation scripts.

Clone our repository along with RoboTwin:

git clone --recurse-submodules https://github.com/jhz1192/VR-DP3.git

Set up the runtime environment using the scripts provided by RoboTwin official doc:

conda create -n vr_dp3 python=3.10 -y
conda activate vr_dp3
cd RoboTwin
bash script/_install.sh
pip install zarr==2.12.0 wandb ipdb gpustat dm_control omegaconf hydra-core==1.2.0 dill==0.3.5.1 einops==0.4.1 diffusers==0.11.1 numba==0.56.4 moviepy imageio av matplotlib termcolor

Note that PyTorch3D must be installed successfull. For more detailed installation instructions, please refer to the official doc.

Download assets:

bash script/_download_assets.sh

After setting up the environment, copy the VRDP3 folder into RoboTwin’s policy/ directory:

cd ..
cp -r VRDP3/ RoboTwin/policy/

Install VRDP3 by running:

cd RoboTwin/policy/VRDP3/3D-Diffusion-Policy
pip install -e .
cd ../../..

🚀 Usage

Collect Data

We use the data collection scripts provided by RoboTwin:

bash collect_data.sh ${task_name} ${task_config} ${gpu_id}
# Example: bash collect_data.sh beat_block_hammer demo_clean 0

Note: Please set datatype.pointcloud to true in task_config/demo_clean.yml.

Training and Evaluation

  1. Prepare Training Data:
cd policy/PocketDP3
bash process_data.sh ${task_name} ${task_config} ${expert_data_num}
# Example: bash process_data.sh beat_block_hammer demo_clean 50
  1. Training VR-DP3 by:
bash train.sh ${task_name} ${task_config} ${expert_data_num} ${seed} ${gpu_id}
# bash train.sh beat_block_hammer demo_clean 50 0 0
  1. Evaluate VR-DP3 by:
bash eval.sh ${task_name} ${task_config} ${ckpt_setting} ${expert_data_num} ${seed} ${gpu_id}
# bash eval.sh beat_block_hammer demo_clean demo_clean 50 0 0

All training and evaluation follow the official DP3 scripts provided by RoboTwin. If you encounter any issues, please refer to the official RoboTwin DP3 documentation.

💡Tips: If you need to modify KL-weight $\beta$, please edit it in VRDP3/3D-Diffusion-Policy/diffusion_policy_3d/policy/dp3_vr.py.

📈 Results

Performance on RoboTwin 2.0 Benchmark

Evaluation on the RoboTwin 2.0 benchmark.

Task DP3 DP3 + VR Improvement
Handover Block 70.0% 91.0% +21.0%
Move Can Pot 70.0% 90.0% +20.0%
Open Microwave 61.0% 88.0% +21.0%
Place Cans Plasticbox 48.0% 70.0% +22.0%
Stamp Seal 18.0% 36.0% +18.0%
... ... ... ...
Average (49 tasks) 56.3% 62.4% +6.1%

🎓 Citation

If you find VR-DP3 useful for your research, please kindly cite our paper:

@article{zhang2026information,
  title={Information Filtering via Variational Regularization for Robot Manipulation},
  author={Zhang, Jinhao and Xia, Wenlong and Wang, Yaojia and Zhou, Zhexuan and Li, Huizhe and Lai, Yichen and Song, Haoming and Gong, Youmin and Me, Jie},
  journal={arXiv preprint arXiv:2601.21926},
  year={2026}
}

📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

🌟 Acknowledgments

  • Robotwin for the simulation bechmark
  • DP3 for the strong baseline work

✉️ Contact

If you have any questions, please contact us at:

About

Official Implementation of "Information Filtering via Variational Regularization for Robot Manipulation"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors