Reinforcement Learning with the quadruped robot Unitree Go1

Code structure

Only relevant files are described in the following tree structure:

deeprl_cheetah
│   test.py
│   train.py
│
└───results
│   │   
│   └───runner            # Contains the files of the best training process so far !
│       │   config.yaml   # configuration file used
│       │   data.pkl      # data collection for plotting
│       │   model.pt      # model parameter file
│   
└───rl_modules
│   │   actor_critic.py   # MLP instances for actor and critic
│   │   storage.py        # Storage class (save data, GAE)
│   │   rl_agent.py       # Agent class (PPO, play, learn, ..)
│
└───env
│   │   go_env.py
│   └───go
│       │   go1.xml         
│       │   go1_torque.xml  
│       │   scene.xml         # scene that calls go1.xml file
│       │   scene_torque.xml  # scene that calls go1_torque.xml file 
│                             # (used for torque control with config.yaml->use_torque:true)
│
└───networks
│   │   networks.py     # MLP Network Definition
│
└───config
│   │   config.yaml
│   │   config_handler.py # Config-Class is used for handling the "config.yaml"-File
│
└───checkpoints
│   │
│   └───latest            # Contains the files of the latest training process !
│       │   config.yaml   # latest configuration file used
│       │   data.pkl      # latest data collection for plotting
│       │   model.pt      # latest model parameter file
│   │
│   └───2024-01-14        # other training session
│   │
│   └───2024-01-15
│   .
│   . 
│   .

Usage

Using the `train.py`-file

For model training you should set up the properties in the config/config.yaml. This file contains settings regaring the environment, reinforcement learning algorithm, networks and reward function. It is copied to the checkpoint location checkpoints/YYYY-MM-DD/HH-MM-SS/model/.
During the training process the model parameters and relevant training data are saved in the location checkpoints/.../model/, this folder is copied to the location checkpoints/latest and represents the latest training data.

Using the `test.py`-file

The test-file can be used with a specific experiment (use_experiment=True), where the experiment folder has to be in the location results/experiment_name/ and the variable experiment=experiment_name has to be defined.
It is also possible to select a specific training set (use_experiment=False) as implemented in the original test.py-file.

Note: The "runner" experiment is the best result so far, this is executed if you run the "test.py" file without changes !

Project configuration and data logging

The config file "config/config.yaml" is copied into the checkpoint folder checkpoint/.../model. In addition, the recorded relevant data values in rl_agend.learn() are saved as a dictionary in the same folder with a copy of the newest model.pt of the current training process.

TODO

check implementations
calculate feet position
add experience replay

References

ETH:
Nature Paper: https://www.nature.com/articles/s41598-023-38259-7
use LTSM as hidden layer for network:
- https://github.com/Kaixhin/ACER/tree/master
PPO: https://spinningup.openai.com/en/latest/algorithms/ppo.html
Reward-functions:
- https://www.nature.com/articles/s41598-023-38259-7
- https://colab.research.google.com/github/google-deepmind/mujoco/blob/main/mjx/tutorial.ipynb#scrollTo=y79PoJOCIl-O

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Papers		Papers
RL_Project		RL_Project
Slides		Slides
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning with the quadruped robot Unitree Go1

Code structure

Usage

Using the `train.py`-file

Using the `test.py`-file

Project configuration and data logging

TODO

References

About

Releases

Packages

Contributors 2

Languages

DavidDema/DeepRL-quadruped-robot

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning with the quadruped robot Unitree Go1

Code structure

Usage

Using the train.py-file

Using the test.py-file

Project configuration and data logging

TODO

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Using the `train.py`-file

Using the `test.py`-file

Packages