Reinforcement_learning

This is an implementation of a small find the park route game using reinforcement learning.

overview

A 4x4 maze is used for this perticular game. It follows a reward system for calculation of best path to reach to park. Initially, it uses exploration policy to find better rewards and on finding positive rewards it tries to exploit it for the maximaiztion of total rewards. It uses transition state matix and action matrix for calculation of Q table. On multiple iterations, the weights of Q table adjusted according to the state matrix and action matrix. On the basis of Q table values, Best path has been choosen for the agent.

Functioning

A 4x4 map was build with a total of 16 cells.
Reward_matrix was designed on the basis of possible routes having different obstacles(dogs= -10) and extra rewards(candies= +50).
Transition state matrix is designed on the basis of possible transition from one cell to next cell in any direction. Transition_state_matrix lays out all the possible movements to another state given the current state. current state here refers to the present cell where agent is at that moment.
Action Matrix is designed on the basis of no. of possible action for agent in any cell. Here, the action refers to possible moves in all directions. Five cases were considered for each movement. These movements are Up,Down,Left,Right and Neutral(staying in same cell)
Q table is calculated with tha help of Bellman's equation. All the values of Q table has been calculated using transistion state matrix and action matrix.
Best route is calculated from the values in Q table inorder to maximize the reward.

Reference

https://www.freecodecamp.org/news/an-introduction-to-q-learning-reinforcement-learning-14ac0b4493cc/

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Treasurehunt.ipynb		Treasurehunt.ipynb
parkgame.png		parkgame.png
parkgameresult.png		parkgameresult.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement_learning

overview

Functioning

Reference

About

Releases

Packages

Languages

jaypal1-7/Reinforcement_learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement_learning

overview

Functioning

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages