Skip to content

Latest commit

 

History

History
13 lines (11 loc) · 256 Bytes

File metadata and controls

13 lines (11 loc) · 256 Bytes

Reinforcmenet Learning Algorithms Practice Repository

Algorithm list

  1. PPO (proximal policy optimization)

Environment list

  1. cart pole
  2. OpenAI MPE

Pip list

  1. pytorch==1.10.1
  2. gym==0.21.0
  3. pettingzoo==1.14.0
  4. supersuit==3.3.2