Skip to content
VinF edited this page Mar 9, 2016 · 17 revisions

#General_Deep_Q_RL

Introduction

This package provides a general deep Q-learning algorithm. It is based on the original deep Q learning algorithm described in : Mnih, Volodymyr, et al. "Human-level control through deep reinforcement learning." Nature 518.7540 (2015): 529-533.

Contrary to the original code, this package provides a more general framework where observations are made up of any number of elements : scalars, vectors and frames (instead of one type of frame only in the above mentionned paper). At each time step, an action is taken by the agent and one observation is gathered. The belief state on which the agent is based to build the Q function is made up of any length history of each element provided in the observation.

The code is based on Python (compatible with python 2.7 or python 3). The neural network architectures uses Theano (with or without the lasagne library). However, the framework is made in such a way that it is very easy to use your favorite python-based framework to code your own neural network architecture.