This project is to test the perfomance of ARS on breakout-ram-V4

Augmented Random Search (ARS)

ARS is a random search method for training linear policies for continuous control problems, based on the paper "Simple random search provides a competitive approach to reinforcement learning."

Prerequisites for running ARS

Our ARS implementation relies on Python 3, OpenAI Gym version 0.9.3, mujoco-py 0.5.7, MuJoCo Pro version 1.31, and the Ray library for parallel computing.

To install OpenAI Gym and MuJoCo dependencies follow the instructions here: https://github.com/openai/gym

To install Ray execute:

pip install ray

For more information on Ray see http://ray.readthedocs.io/en/latest/.

Running ARS

First start Ray by executing a command of the following form:

ray start --head --redis-port=6379 --num-cpus=16

This command starts multiple Python processes on one machine for parallel computations with Ray. Set "num_cous=X" for parallelizing ARS across X CPUs. For parallelzing ARS on a cluster follow the instructions here: http://ray.readthedocs.io/en/latest/using-ray-on-a-large-cluster.html.

We recommend using single threaded linear algebra computations by setting:

export MKL_NUM_THREADS=1

To train a policy for breakout, execute the following command:

python code/ars.py

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
code		code
code_multi		code_multi
LICENSE		LICENSE
README.md		README.md
run.slurm		run.slurm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This project is to test the perfomance of ARS on breakout-ram-V4

Augmented Random Search (ARS)

Prerequisites for running ARS

Running ARS

About

Releases

Packages

Languages

License

junjzhang/ARS

Folders and files

Latest commit

History

Repository files navigation

This project is to test the perfomance of ARS on breakout-ram-V4

Augmented Random Search (ARS)

Prerequisites for running ARS

Running ARS

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages