Framework for deep reinforcement learning.
- Algorithms are splited into modules
- Easy to run algorithms asynchronously
- Easy to add new algorithms
- python3.6
- numpy
- pytorch
- gym
git clone https://github.com/ppaanngggg/DeepRL
pip install -e .
-
DoubleDQNAgent: Basic deep Q learning with double Q learning
Human-level control through deep reinforcement learning
Deep Reinforcement Learning with Double Q-learning
-
DDPGAgent: continue control by deep deterministic policy gradient
CONTINUOUS CONTROL WITH DEEP REINFORCEMENT LEARNING
-
PPOAgent: continue control by proximal policy optimization
Proximal Policy Optimization Algorithms
-
Replay: Basic replay, randomly choose from pool and remove the oldest one
Human-level control through deep reinforcement learning
-
ReservoirReplay: randomly choose from pool and randomly remove one, used in NFSPAgent's policy network
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
-
TmpReplay: just for module, no replay at all
- Train: normal trainer
- TrainEpoch:
- AsynTrainEpoch: it will
- EnvAbstract: Env interface, similar to gym's interfaces. User has to reimplement interface functions
- turn python2 to python3.6
- turn tensorflow to pytorch
- add more agent
- well doc