DiverseRL

DiverseRL is a repository that aims to implement and benchmark reinforcement learning algorithms.

This repo aims to implement algorithms of various sub-topics in RL (e.g. model-based RL, offline RL), in wide variety of environments.

Features

Training loop in a single function
- Each algorithm's training procedure is written in a single function for readers' convenience.
Single-file configuration file
- With the use of Hydra, you can manage settings of your experiment in a single yaml file.
Logging
- WandB
- Tensorboard

Installation

You can install the requirements by using Poetry.

git clone https://github.com/moripiri/DiverseRL.git
cd DiverseRL

poetry install

Algorithms

Currently, the following algorithms are available.

Model-free Deep RL

Model-free Deep RL algorithms are set of algorithms that can train environments with state-based observations without model.

Pixel RL

Pixel RL contains set of algorithms that are set to train environments with high-dimensional images as observations, Such as Atari 2600 and dm-control.

DQN (Deep Q-Network) (for Atari 2600)
SAC (Soft Actor Critic) (for dm-control)
PPO (Proximal Policy Gradients) (for Atari 2600)
SAC-AE (Soft Actor Critic with Autoencoders) (for dm-control)
CURL (Contrastive Unsupervised Representations for Reinforcement Learning) (for dm-control)
RAD (Reinforcement Learning with Augmented Data) (for dm-control)
DrQ (Data-Regularized Q) (for dm-control)
DrQ-v2 (Data-Regularized Q v2) (for dm-control)

Offline RL

Offline RL algorithms are set of algorithms that can learn from set of environment trajectories.

Any Percent BC (Behavior Cloning)
CQL (Conservative Q-learning)

Classic RL

Classic RL algorithms that are mostly known by Sutton's Reinforcement Learning: An Introduction. Can be trained in Gymnasium's toy text environments.

SARSA
Q-learning
Model-free Monte-Carlo Control
Dyna-Q

Getting Started

Training requires two gymnasium environments(for training and evaluation), algorithm, and trainer.

from diverserl.algos import DQN
from diverserl.trainers import DeepRLTrainer
from diverserl.common.utils import make_envs

env, eval_env = make_envs(env_id='CartPole-v1')

algo = DQN(env=env, eval_env=eval_env, **config)

trainer = DeepRLTrainer(
    algo=algo,
    **config
)

trainer.run()

Or you use hydra by running run.py.

python run.py env=gym_classic_control algo=dqn trainer=deeprl_trainer algo.device=cuda trainer.max_step=10000

python run.py --config-name ppo_gym_atari algo.device=cuda trainer.log_wandb=true

python run.py --config-name dqn_gym_atari algo.device=cuda trainer.log_wandb=true

Name		Name	Last commit message	Last commit date
Latest commit History 180 Commits
.devcontainer		.devcontainer
diverserl		diverserl
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
pyproject.toml		pyproject.toml
run.log		run.log
run.py		run.py
test2.py		test2.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DiverseRL

Features

Installation

Algorithms

Model-free Deep RL

Pixel RL

Offline RL

Classic RL

Getting Started

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

moripiri/DiverseRL

Folders and files

Latest commit

History

Repository files navigation

DiverseRL

Features

Installation

Algorithms

Model-free Deep RL

Pixel RL

Offline RL

Classic RL

Getting Started

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages