Skip to content

PyTorch implementations of common deep reinforcement learning algorithms: DQN, PPO, SAC, TD3

Notifications You must be signed in to change notification settings

BDEvan5/pytorch_drl

Repository files navigation

Deep reinforcement learning in PyTorch

This repo provides straightforward implementations of common DRL algorithms.

Algorithm list:

  • DQN: Deep-Q-network
  • PG: Policy gradient algorithm
  • A2C: Advantage actor critic
  • PPO: Proximal policy optimisation
  • DDPG: Deep deterministic policy gradient
  • TD3: Twin-delayed-DDPG
  • SAC: Soft actor critic

The document IntroToDRL.pdf provides an introduction to deep reinforcement learning and the important formulas behind the algorithms.

Training Rewards

CartPole CartPole
CartPole CartPole
Pendulum Pendulum Pendulum

Structure

The single_file/ folder contains files with working examples of each algorithm. The modular/ folder contains the same algorithms, but split into their modular.

About

PyTorch implementations of common deep reinforcement learning algorithms: DQN, PPO, SAC, TD3

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages