Temporal-Difference-Models-with-Value-Function-Based-sampling

Keras implementation of Temporal Difference Models by V. Pong et al.(2018) + Value based sampling

Requirements:

The structure of this code is built on Keras-rl

A few tweaks that we did -

Relabelling goals based on expected reward henceforth, with some probability. We found that it lead to faster convergence in the FetchReach environment.
Decayed the 'goal reached' condition radius gradually. It lead to faster convergence as well

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
agents		agents
Final_report.pdf		Final_report.pdf
README.md		README.md
__init__.py		__init__.py
callbacks.py		callbacks.py
core.py		core.py
ddpg.py		ddpg.py
ddpg_mujoco.py		ddpg_mujoco.py
memory.py		memory.py
policy.py		policy.py
processors.py		processors.py
random_rl.py		random_rl.py
util.py		util.py

Provide feedback