tensorpack/examples/DeepQNetwork at master · pjh4993/tensorpack

History

Name		Name	Last commit message	Last commit date
parent directory ..
DQN.py		DQN.py
DQNModel.py		DQNModel.py
README.md		README.md
atari.py		atari.py
atari_wrapper.py		atari_wrapper.py
breakout.jpg		breakout.jpg
common.py		common.py
curve-breakout.png		curve-breakout.png
expreplay.py		expreplay.py

README.md

video demo

Reproduce (performance of) the following reinforcement learning methods:

Nature-DQN in: Human-level Control Through Deep Reinforcement Learning
Double-DQN in: Deep Reinforcement Learning with Double Q-learning
Dueling-DQN in: Dueling Network Architectures for Deep Reinforcement Learning
A3C in Asynchronous Methods for Deep Reinforcement Learning. (I used a modified version where each batch contains transitions from different simulators, which I called "Batch-A3C".)

Usage:

Install dependencies by pip install 'gym[atari]'.

With ALE (paper's setting):

Download an atari rom, e.g.:

wget https://github.com/openai/atari-py/raw/gdb/atari_py/atari_roms/breakout.bin

Start Training:

./DQN.py --env breakout.bin
# use `--algo` to select other DQN algorithms. See `-h` for more options.

Watch the agent play:

# Download pretrained models or use one you trained:
wget http://models.tensorpack.com/DeepQNetwork/DoubleDQN-breakout.bin.npz
./DQN.py --env breakout.bin --task play --load DoubleDQN-breakout.bin.npz

Evaluation of 50 episodes:

./DQN.py --env breakout.bin --task eval --load DoubleDQN-breakout.bin.npz

With gym's Atari:

Install gym and atari_py. Use --env BreakoutDeterministic-v4 instead of the ROM file.

Performance

Claimed performance in the paper can be reproduced, on several games I've tested with.

Environment	Avg Score	Download
breakout.bin	465	⬇️
seaquest.bin	8686	⬇️
ms_pacman.bin	3323	⬇️
beam_rider.bin	15835	⬇️

Speed

On one GTX 1080Ti, the ALE version took ~2 hours of training to reach 21 (maximum) score on Pong, ~10 hours of training to reach 400 score on Breakout. It runs at 100 batches (6.4k trained frames, 400 seen frames, 1.6k game frames) per second on GTX 1080Ti. This is likely the fastest open source TF implementation of DQN.

A3C code and models for Atari games in OpenAI Gym are released in examples/A3C-Gym

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepQNetwork

DeepQNetwork

README.md

Usage:

With ALE (paper's setting):

With gym's Atari:

Performance

Speed

Files

DeepQNetwork

Directory actions

More options

Directory actions

More options

Latest commit

History

DeepQNetwork

Folders and files

parent directory

README.md

Usage:

With ALE (paper's setting):

With gym's Atari:

Performance

Speed