Skip to content

Keras implementation of Temporal Difference Model by V. Pong et al. (2018) with some tweaks

Notifications You must be signed in to change notification settings

satyenrajpal/Temporal-Difference-Models-with-Value-Function-Based-sampling

Repository files navigation

Temporal-Difference-Models-with-Value-Function-Based-sampling

Keras implementation of Temporal Difference Models by V. Pong et al.(2018) + Value based sampling

Requirements:

  • Keras
  • Tensorflow or Theano as backend
  • Gym - Robotics environment along with Mujoco environment

The structure of this code is built on Keras-rl

A few tweaks that we did -

  1. Relabelling goals based on expected reward henceforth, with some probability. We found that it lead to faster convergence in the FetchReach environment.
  2. Decayed the 'goal reached' condition radius gradually. It lead to faster convergence as well

About

Keras implementation of Temporal Difference Model by V. Pong et al. (2018) with some tweaks

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages