mlv_2020_project

A quick dive into comparing the verifiable safety of safe reinforcement learning vs state-of-the-art deep reinforcement learning. This work focuses on comparing the DDPG implementations from End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Functions and their Github implementation at https://github.com/rcheng805/RL-CBF.

Installation and Reproducability

To install and reproduce the results presented in the report, you will need Anaconda Python installed (make sure you have the Python3.7 version) and access to a bash-enabled terminal.

Once you have Anaconda installed, navigate in your terminal to this directory and run

./setup.bash

This will create an Anaconda environment for running all of our scripts in. The environment is as close as we could get to the original work's, however, some warnings may appear while running the scripts, but they will run.

Now your system is setup and can run the reproduce code:

./reproduce_models.bash

Feel free to make modifications within this bash script to try different random seeds. Note: this will overwrite the models currently available in the repo.

To verify the models, make sure you have the NNV tool installed and setup on your system. Then, add the verification folder to your MATLAB working directory and run verify_models.m. That will run all of the verification tests described in the report.

If there are any issues using or running the files in this repo, please contact me at [email protected] and I'll try to help you as best I can.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
DDPG-CBF		DDPG-CBF
DDPG		DDPG
figures		figures
verification		verification
README.md		README.md
generate_tables.py		generate_tables.py
plot_results.py		plot_results.py
reproduce_models.bash		reproduce_models.bash
setup.bash		setup.bash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mlv_2020_project

Installation and Reproducability

About

Releases

Packages

Languages

nphamilton/mlv_2020_project

Folders and files

Latest commit

History

Repository files navigation

mlv_2020_project

Installation and Reproducability

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages