State-Novelty Guided Action Persistence in Deep Reinforcement Learning

This is an original PyTorch implementation of incorporating temporally persistent exploration in DrQ-v2 from

State-Novelty Guided Action Persistence in Deep Reinforcement Learning by Jianshu Hu, Paul Weng and Yutong Ban.

Method

We implement State-Novelty guided adaptive Action Persistence (SNAP) based on DrQv2.

Instructions

Install MuJoCo if it is not already the case:

Obtain a license on the MuJoCo website.
Download MuJoCo binaries here.
Unzip the downloaded archive into ~/.mujoco/mujoco200 and place your license key file mjkey.txt at ~/.mujoco.
Use the env variables MUJOCO_PY_MJKEY_PATH and MUJOCO_PY_MUJOCO_PATH to specify the MuJoCo license key path and the MuJoCo directory path.
Append the MuJoCo subdirectory bin path into the env variable LD_LIBRARY_PATH.

Install the following libraries:

sudo apt update
sudo apt install libosmesa6-dev libgl1-mesa-glx libglfw3

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent with original DrQv2:

python train.py task=quadruped_walk

Train the agent with SNAP:

python train.py task=quadruped_walk repeat_type=1 action_repeat=1 update_every_steps=4 nstep=6

Monitor results:

tensorboard --logdir exp_local

License

The majority of this code is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
drqv2-main		drqv2-main
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

State-Novelty Guided Action Persistence in Deep Reinforcement Learning

Method

Instructions

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Jianshu-Hu/action-persistence

Folders and files

Latest commit

History

Repository files navigation

State-Novelty Guided Action Persistence in Deep Reinforcement Learning

Method

Instructions

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages