Implement of serval reinforcement learning algorithm
- algorithms: implement of game theory algorithm
- docs: document of each algorithm and environment
- envs: game environments
- utils: common tools
Algorithm | Reference | Status |
---|---|---|
DQN | 10.1038/nature14236 | ✅ |
SARSA | ✅ | |
Double DQN | 10.1609/aaai.v30i1.10295 | ✅ |
Dueling DQN | 10.48550/arXiv.1511.06581 | ✅ |
Rainbow DQN | 10.1609/aaai.v32i1.11796 | |
REINFORCE | 10.1007/BF00992696 | ✅ |
VPG(Vanilla Policy Gradient) | ✅ | |
AC(Actor Critic) | ~ | |
A2C | ||
A3C | ||
DPG(Deterministic Policy Gradient) | ||
DDPG | ||
TRPO(Trust Region Policy Optimization) | 10.5555/3045118.3045319 | |
PPO(Proximal Policy Optimization) | arXiv:1707.06347 | ✅ |
TD3 |
- thu-ml/tianshou: An elegant PyTorch deep reinforcement learning library. (github.com)
- deepmind/open_spiel: OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games. (github.com)
- PyTorch documentation — PyTorch 1.13 documentation
- Welcome to Spinning Up in Deep RL! — Spinning Up documentation (openai.com)