Clean single-file implementation of offline RL algorithms in JAX
reinforcement-learning
flax
cql
single-file
jax
awac
iql
offline-rl
offline-reinforcement-learning
d4rl
decision-transformer
td3bc
-
Updated
Aug 15, 2024 - Python