Reinforcement Learning

Q-learning is one of the early breakthroughs in reinforcement learning developed by Chris Watkins in 1989 when he was a graduate student at Cambridge University. The algorithm is efficient and simple. It is still one of the most popular algorithms today. This report shows the implementation of Q-learning in a navigation task. The impact of different parameters (e.g. learning rate, discount rate, exploration factor, decay) are studied. A grid search is performed to find the optimal combination. Furthermore, Double Q-learning, invented to solve overestimation from Q-learning, is implemented to understand the differences to Q-learning. In addition, the parallelism between Q-learning and psychological learning theories is discussed.

Author: Harry Li, Xin Li

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
notebooks		notebooks
.DS_Store		.DS_Store
INM426.Harry.Li.Xin.Li.pdf		INM426.Harry.Li.Xin.Li.pdf
Map.pptx		Map.pptx
README.md		README.md
matrix.xlsx		matrix.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

harryli18/reinforcement_learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages