Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
-
Updated
Feb 10, 2024 - Python
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
🌟 Align diffusion processes with detailed human preferences to improve machine learning models for richer, more accurate outputs.
Add a description, image, and links to the behavior-regularization topic page so that developers can more easily learn about it.
To associate your repository with the behavior-regularization topic, visit your repo's landing page and select "manage topics."