Skip to content

JF-Hu/Continual_Diffuser

Repository files navigation

Continual Diffuser (CoD)

Code for "Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal".

Framework

  • We construct a continual offline RL benchmark that contains 90 tasks in the current stage, and we will actively incorporate more datasets for all researchers.

  • We investigate the possibility of integrating experience rehearsal and diffuser, then propose the Continual Diffuser (CoD) to balance plasticity and stability.

  • Extensive experiments on a series of tasks show that CoD can achieve a promising plasticity-stability trade-off and outperform existing baselines on most tasks.

First clone the code and installation of the relevant package.

pip install -r requirements.txt

Before you start we strongly recommend that you register a wandb account. This will record graphs and curves during the experiment. If you want, complete the login operation in your shell. Enter the following command and follow the prompts to complete the login.

wandb login

API keys can be found in User Settings page https://wandb.ai/settings. For more information you can refer to https://docs.wandb.ai/quickstart .

Next is how to replicate all experiments:

For Model Training

If use default training config:

python continual_diffuser_train.py 

The instruction of how to set the various hyperparameters will be updated soon.

For offline datasets

We will release the offline datasets soon, and the benchmark will be updated aperiodically. Please follow our updated instructions.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages