Skip to content

Official code for "One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning", NeurIPS 2023

License

Notifications You must be signed in to change notification settings

marc-rigter/1R2R

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning

Official code to reproduce the experiments in the paper "One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning ".

Installation

  1. Install MuJoCo 2.1.0 to ~/.mujoco/mujoco210.
  2. Create a conda environment and install 1R2R:
cd 1R2R
conda create --name 1R2R python=3.7
conda activate 1R2R
pip install -e .
pip install -r requirements.txt

Datasets

The datasets introduced for stochastic domains can be found on the HuggingFace Hub as well as Google Drive. By default, the code expects that the datasets are located in the folder 1R2R/datasets.

Usage

Configuration files can be found in examples/config/. For example, to run the stochastic hopper-medium-replay task with high noise, use the following:

1R2R run_example examples.development --config examples.config._1R2R.stochastic_mujoco.hopper_high_noise_medium_replay --seed 0 --gpus 1

If importlib is unable to import the desired config file, this can be resolved by adding to the PYTHONPATH:

export PYTHONPATH="${PYTHONPATH}:/path/to/1R2R"

Logging

By default, TensorBoard logs are generated in the "logs" directory. The code is also set up to log using Weights and Biases (WandB). To enable the use of WandB, set "log_wandb" to True in the configuration file.

Citing 1R2R

@article{rigter2023,
  title={One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning},
  author={Rigter, Marc and Lacerda, Bruno and Hawes, Nick},
  journal={Advances in Neural Information Processing Systems},
  year={2023}
}

About

Official code for "One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning", NeurIPS 2023

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages