GitHub - clay-curry/flapPy-RL: an RL algorithm solving Flappy Bird. by setting returns R to be the number obstacles cleared upon crashing, q* : S × A → ℝ generates the expectation E(R) from the state-action pair (s, a). experiments support the conjecture that a tabular, n-step Sarsa algorithm converges to a policy π clearing arbitrarily many obstacles (confirmed up to 1,000,000)

clay-curry / flapPy-RL Public

an RL algorithm solving Flappy Bird. by setting returns R to be the number obstacles cleared upon crashing, q* : S × A → ℝ generates the expectation E(R) from the state-action pair (s, a). experiments support the conjecture that a tabular, n-step Sarsa algorithm converges to a policy π clearing arbitrarily many obstacles (confirmed up to 1,000,000)

1 star 0 forks Branches Tags Activity

Star

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
assets		assets
data		data
.gitignore		.gitignore
config.py		config.py
flappy.py		flappy.py
n_sarsa.py		n_sarsa.py
q_agent.py		q_agent.py
q_agent_flappy.py		q_agent_flappy.py