Exclusively Penalized Q-learning for Offline Reinforcement Learning

This repository considers the implementation of the paper "Exclusively Penalized Q-learning for Offline Reinforcement Learning" which has been accepted to Neurips 2024 as a spotlight paper.

Installation Guide

Create conda env

conda create -n EPQ python=3.7.16

Activate conda env

conda activate EPQ

Install pytorch (with GPU)

conda install pytorch==1.12.1 torchvision torchaudio cudatoolkit=10.2 -c pytorch

or (with CPU, Not Recommend)

conda install pytorch==1.12.1 torchvision=0.13.1 torchaudio==0.12.1 cpuonly -c pytorch

Install gym

pip install gym[all]==0.17.2

Install d4rl from https://github.com/rail-berkeley/d4rl
Install other packages

pip install h5py tqdm pyyaml python-dateutil matplotlib gtimer scikit-learn
  numba==0.56.2 path.py==12.5.0 patchelf==0.15.0.0 joblib==1.2.0 gtimer python-dateutil matplotlib scikit-learn wandb

Run EPQ

This codebase is built on rlkit (https://github.com/vitchyr/rlkit/), and implements CQL (https://github.com/aviralkumar2907/CQL). To run our code, follow the installation instructions for rlkit as shown below, then install D4RL(https://github.com/rail-berkeley/d4rl).

Then we can run EPQ with an example as follow :

First, train VAE for the behavior policy

python behavior_cloning.py --env=halfcheetah-medium-v2

After training the behavior model, we can run EPQ by executing :

python EPQ_main.py --env=halfcheetah-medium-v2

Detailed Installation Guide for d4rl

Clone the git repository git clone https://github.com/rail-berkeley/d4rl.git
Move to d4rl directory cd d4rl
Install d4rl pip install -e .

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
docs		docs
environment		environment
rlkit		rlkit
run_examples		run_examples
README.md		README.md
behavior_cloning.py		behavior_cloning.py
clustering.py		clustering.py
epq_main.py		epq_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exclusively Penalized Q-learning for Offline Reinforcement Learning

Installation Guide

Run EPQ

Detailed Installation Guide for d4rl

About

Releases

Packages

Languages

hyeon1996/EPQ

Folders and files

Latest commit

History

Repository files navigation

Exclusively Penalized Q-learning for Offline Reinforcement Learning

Installation Guide

Run EPQ

Detailed Installation Guide for d4rl

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages