Continual Diffuser (CoD)

Code for "Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal".

We construct a continual offline RL benchmark that contains 90 tasks in the current stage, and we will actively incorporate more datasets for all researchers.
We investigate the possibility of integrating experience rehearsal and diffuser, then propose the Continual Diffuser (CoD) to balance plasticity and stability.
Extensive experiments on a series of tasks show that CoD can achieve a promising plasticity-stability trade-off and outperform existing baselines on most tasks.

First clone the code and installation of the relevant package.

pip install -r requirements.txt

Before you start we strongly recommend that you register a wandb account. This will record graphs and curves during the experiment. If you want, complete the login operation in your shell. Enter the following command and follow the prompts to complete the login.

wandb login

API keys can be found in User Settings page https://wandb.ai/settings. For more information you can refer to https://docs.wandb.ai/quickstart .

Next is how to replicate all experiments:

For Model Training

If use default training config:

python continual_diffuser_train.py

The instruction of how to set the various hyperparameters will be updated soon.

For offline datasets

We will release the offline datasets soon, and the benchmark will be updated aperiodically. Please follow our updated instructions.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
datasets		datasets
evaluation		evaluation
figures		figures
models		models
output/slurm_result		output/slurm_result
path_process		path_process
trainer		trainer
README.md		README.md
continual_diffuser_train.py		continual_diffuser_train.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continual Diffuser (CoD)

For Model Training

For offline datasets

About

Releases

Packages

Languages

JF-Hu/Continual_Diffuser

Folders and files

Latest commit

History

Repository files navigation

Continual Diffuser (CoD)

For Model Training

For offline datasets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages