Skip to content
/ ARS Public
forked from modestyachts/ARS

An implementation of the Augmented Random Search algorithm for The DeepMind Control Suite and Package

License

Notifications You must be signed in to change notification settings

pedronahum/ARS

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This fork is an extension of ARS for The DeepMind Control Suite and Package.

Implemented policies:

  • linear: no changes to the linear policy from ARS.
  • snp: Policy that increases the input dimension with a max operator SNP
  • mlp: mlp policy with layer normalization.

Work-in-progress:

  • lenn: policy that increases the input dimension with Legendre polynomials
  • mlp-max: taking the ideas from snp, a mlp policy that increases the input dimension with a max operator
  • polynomial: A polynomial policy with input normalization.
  • linear-ensemble: linear policies are combined through a weighted sum (a la bagging)
  • linear-residual-policy: A "leader" policy plus additional helper policies (work in progress...)

Augmented Random Search (ARS)

ARS is a random search method for training linear policies for continuous control problems, based on the paper "Simple random search provides a competitive approach to reinforcement learning."

Prerequisites for running ARS

Our ARS implementation relies on Python 3, OpenAI Gym, DM Control, and the Ray library for parallel computing.

To install DM Control and MuJoCo dependencies follow the instructions here: https://github.com/deepmind/dm_control

To install Ray execute:

pip install ray

For more information on Ray see http://ray.readthedocs.io/en/latest/.

Running ARS

We recommend using single threaded linear algebra computations by setting:

export MKL_NUM_THREADS=1

To train a policy for the "walker" domain with a "walk" task, execute the following command:

python code/ars.py

Rendering Trained Policy

To render a trained policy, execute a command of the following form:

python code/run_ars_policy.py

Please note that movie-py is needed to build the gif

About

An implementation of the Augmented Random Search algorithm for The DeepMind Control Suite and Package

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 53.6%
  • Jupyter Notebook 42.8%
  • Python 3.6%