diff --git a/README.md b/README.md index 0fcb5dd..6c49ec3 100644 --- a/README.md +++ b/README.md @@ -1 +1,66 @@ -# Coming Soon... +

WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving

+
+ Mingwang Xu1  + Jiahao Cui1*  + Feipeng Cai2*  + Hanlin Shang1*  + Zhihao Zhu1  + Shan Luan1  +
+
+ Yifang Xu1*  + Neng Zhang2  + Yaoyi Li2  + Jia Cai2  + Siyu Zhu1  +
+ +
+ 1Fudan University  2Yinwang Intelligent Technology Co., Ltd  +
+ +## 🔧️ Framework +![framework](assets/main_arch.png) + +## 📅️ Roadmap + +| Status | Milestone | ETA | +| :----: | :----------------------------------------------------------------------------------------------------: | :--------: | +| 🚀 | **[Releasing the inference source code](https://github.com/fudan-generative-vision/WAM-Diff)** | 2025.12.21 | +| 🚀 | **[Pretrained models on Huggingface](https://huggingface.co/fudan-generative-ai/WAM-Diff)** | TBD | +| 🚀 | **[Releasing the training scripts](#training)** | TBD | + + +### Quick Inference Demo +The [WAM-Diff](TBD) is now available on Hugging Face Hub. To quickly test the model, follow these simple steps: + +1. **Clone the repository** + ```bash + git clone https://github.com/fudan-generative-vision/WAM-Diff + cd WAM-Diff + ``` +2. **Initialize the environment** + Run the environment setup script to install necessary dependencies: + ```bash + bash init_env.sh + ``` +3. **Prepare the Model** + Download the pretrained WAM-Diff model from Hugging Face to the `./model/WAM-Diff` directory: + ``` + https://huggingface.co/fudan-generative-ai/WAM-Diff + ``` + Download the pretrained Siglip2 model from Hugging Face to the `./model/siglip2-so400m-patch14-384` directory: + ``` + https://huggingface.co/google/siglip2-so400m-patch14-384 + ``` + + +3. **Run the demo script** + Execute the demo script to test WAM-Diff on an example image: + ```bash + bash ./train/inf.sh + ``` + + +## 🤗 Acknowledgements +We gratefully acknowledge the contributors to the [LLaDA-V](https://github.com/ML-GSAI/LLaDA-V), repositories, whose commitment to open source has provided us with their excellent codebases and pretrained models. \ No newline at end of file diff --git a/assets/main_arch.png b/assets/main_arch.png new file mode 100644 index 0000000..2b3f8e4 Binary files /dev/null and b/assets/main_arch.png differ