stable-baselines with JAX & Haiku
reinforcement-learning imitation-learning diffusion haiku dataset-aggregation proximal-policy-optimization behavior-cloning jax soft-actor-critic dm-haiku decision-transformers
-
Updated
Jun 20, 2024 - Python