This repository includes the code to replicate the USAC algorithm from paper:
Improving Actor-Critic Training with Steerable Action-Value Approximation Errors
Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir, 2025
European Conference on Artificial Intelligence
ArXiv
To train the usac on cartpole swingup environment run the following command:
python main.py
If you use USAC, please cite: