Q-Network

Asynchronous implementation of DQN to utilise multi-core + multi-gpu architectures.

Requirements

Python 3

Installation

$ python3 -m venv .env
$ source .env/bin/activate
$ pip install torch torchvision gym

Training

$ source .env/bin/activate
$ python trainer.py --<arg> <value>

Command-line arguments

Argument	Description	Values	Default
--environment	Environment to use for training	string	cartpole
--save_model	Path to save the model	string	''
--save_model	Path to load the model	string	''
--n_workers	Number of workers to use for training	int	1
--target_update_frequency	Sync frequency for target network	int	10
--checkpoint_frequency	Frequency for creating checkpoints	int	10
--lr	Learning rate for training	float	5e-4
--batch_size	Batch size for training	int	32
--gamma	Discount factor value for training	float (should be less than 1.0)	0.99
--eps	Epsilon value for training	float (should be less than 1.0)	0.999
--min_eps	Minimum value for epsilon	float	0.1
--buffer_size	Buffer size	int	100000
--max_grad_norm	Maximum L2 norm for gradients	float	10

Testing

$ source .env/bin/activate
$ python tester.py --<arg> <value>

Command-line arguments

Argument	Description	Values	Default
--environment	Environment to use for testing	string	cartpole
--load_model	Path to load the model	Path	''

Custom Simulator

To use a custom simulator, implement the abstract class BaseSimulator. Implement the QNetwork model by extending BaseModel. Finally, register the simulator and the model in the environments.py.

Reference

V. Mnih et al., “Playing Atari with Deep Reinforcement Learning,” arXiv:1312.5602 [cs], Dec. 2013.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
models		models
simulator		simulator
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
calculate_loss.py		calculate_loss.py
checkpoint.py		checkpoint.py
device.py		device.py
environments.py		environments.py
optimise_model.py		optimise_model.py
optimiser.py		optimiser.py
performer.py		performer.py
tester.py		tester.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Q-Network

Requirements

Installation

Training

Command-line arguments

Testing

Command-line arguments

Custom Simulator

Reference

About

Uh oh!

Releases

Packages

Languages

License

dixantmittal/async-dqn

Folders and files

Latest commit

History

Repository files navigation

Q-Network

Requirements

Installation

Training

Command-line arguments

Testing

Command-line arguments

Custom Simulator

Reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages