Generative replay addition to TD3

PyTorch implementation of Twin Delayed Deep Deterministic Policy Gradients (TD3) with a generative replay component.

The code is heavily modified to work for my research needs

Method is tested on MuJoCo continuous control tasks in OpenAI gym. Networks are trained using PyTorch 1.7 and Python 3.8.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
experimentation/vae		experimentation/vae
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback