Skip to content

KWYi/Deep-Q-Learning-pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement-learning

This is a PyTorch implementation of Deep Q-Learning.

Example (1)- A result from the Atari_Breakout.

Experience memory capacity: 80000 set
Random action ratio: from 1.0 to 0.1 through 1000000 frames

Best eposide:

54_8520.mp4

Learning scores:

Blue line: Score of training episode.
Magenta line: Mean score of last 100 training episodes.
Red star: Score of target agent.

Example (2)- A result from the Cartpole.

Experience memory capacity: 10000 set
Random action ratio: from 0.9 to 0.05 through 200 episodes

Best eposide:

221_172.mp4

Learning scores:

Blue line: Score of training episode.
Magenta line: Mean score of last 100 training episodes.

About

PyTorch implementation of Deep Q-Learning.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published