Skip to content

Latest commit

 

History

History

BipedalWalker-v2

BipedalWalker Problem

Getting Started

The environment to the BipedalWalker is described here.

Solution Video

BipedalWalker-v2

The video shows in the first part the behaviour of the untrained agent and then in comparison the behaviour of the trained agent.

Solution Info

My learning algorithm is a Twin Delayed Deep Deterministic Policy Gradient algorithm (TD3).

Instructions

start Jupyter Notebook BipedalWalker-v2.ipynb and follow the instructions.