UtilitySoftActorCritic

This repository includes the code to replicate the USAC algorithm from paper:

Improving Actor-Critic Training with Steerable Action-Value Approximation Errors
Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir, 2025
European Conference on Artificial Intelligence
ArXiv

USAC

To train the usac on cartpole swingup environment run the following command:

python main.py

Cite as

If you use USAC, please cite:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
architectures.py		architectures.py
control_experiments.py		control_experiments.py
dmcontrol_environment.py		dmcontrol_environment.py
experience_memory.py		experience_memory.py
get_model.py		get_model.py
main.py		main.py
make_environment.py		make_environment.py
models_basic.py		models_basic.py
sac.py		sac.py
usac.py		usac.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

UtilitySoftActorCritic

USAC

Cite as

About

Uh oh!

Releases

Packages

Languages

License

adinlab/UtilitySoftActorCritic.

Folders and files

Latest commit

History

Repository files navigation

UtilitySoftActorCritic

USAC

Cite as

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages