Skip to content

Jianshu-Hu/action-persistence

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 

Repository files navigation

State-Novelty Guided Action Persistence in Deep Reinforcement Learning

This is an original PyTorch implementation of incorporating temporally persistent exploration in DrQ-v2 from

State-Novelty Guided Action Persistence in Deep Reinforcement Learning by Jianshu Hu, Paul Weng and Yutong Ban.

Method

We implement State-Novelty guided adaptive Action Persistence (SNAP) based on DrQv2.

Instructions

Install MuJoCo if it is not already the case:

  • Obtain a license on the MuJoCo website.
  • Download MuJoCo binaries here.
  • Unzip the downloaded archive into ~/.mujoco/mujoco200 and place your license key file mjkey.txt at ~/.mujoco.
  • Use the env variables MUJOCO_PY_MJKEY_PATH and MUJOCO_PY_MUJOCO_PATH to specify the MuJoCo license key path and the MuJoCo directory path.
  • Append the MuJoCo subdirectory bin path into the env variable LD_LIBRARY_PATH.

Install the following libraries:

sudo apt update
sudo apt install libosmesa6-dev libgl1-mesa-glx libglfw3

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent with original DrQv2:

python train.py task=quadruped_walk

Train the agent with SNAP:

python train.py task=quadruped_walk repeat_type=1 action_repeat=1 update_every_steps=4 nstep=6

Monitor results:

tensorboard --logdir exp_local

License

The majority of this code is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

About

Explore temporal invariance/equivariance in DRL

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published