GitHub - maxreciprocate/offline: Offline RL experiments

Evaluating on Graph Shortest Path task from Decision Transformer (Lili Chen et al. 2021):

where for each random graph, a transformer is trained to find optimal trajectories using only 1000 random walks.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
scripts		scripts
captions.py		captions.py
carps.py		carps.py
config.yaml		config.yaml
deepspeed.yaml		deepspeed.yaml
ilql.py		ilql.py
models.py		models.py
randomwalks.py		randomwalks.py
readme.md		readme.md
requirements.txt		requirements.txt
sentiments.py		sentiments.py
utils.py		utils.py

Provide feedback