DeepRL

Framework for deep reinforcement learning.

Features:

Algorithms are splited into modules
Easy to run algorithms asynchronously
Easy to add new algorithms

Dependences

python3.6
numpy
pytorch
gym

Install

git clone https://github.com/ppaanngggg/DeepRL
pip install -e .

Modules:

1. Agent

DoubleDQNAgent: Basic deep Q learning with double Q learning

Human-level control through deep reinforcement learning

Deep Reinforcement Learning with Double Q-learning
DDPGAgent: continue control by deep deterministic policy gradient

CONTINUOUS CONTROL WITH DEEP REINFORCEMENT LEARNING
PPOAgent: continue control by proximal policy optimization

Proximal Policy Optimization Algorithms

2. Replay

Replay: Basic replay, randomly choose from pool and remove the oldest one

Human-level control through deep reinforcement learning
ReservoirReplay: randomly choose from pool and randomly remove one, used in NFSPAgent's policy network

Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
TmpReplay: just for module, no replay at all

3. Train

Train: normal trainer
TrainEpoch:
AsynTrainEpoch: it will

4. Env

EnvAbstract: Env interface, similar to gym's interfaces. User has to reimplement interface functions

TODO

turn python2 to python3.6
turn tensorflow to pytorch
add more agent
well doc

Name		Name	Last commit message	Last commit date
Latest commit History 134 Commits
DeepRL		DeepRL
samples		samples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepRL

Features:

Dependences

Install

Modules:

1. Agent

2. Replay

3. Train

4. Env

TODO

About

Releases

Packages

Contributors 2

Languages

License

ppaanngggg/DeepRL

Folders and files

Latest commit

History

Repository files navigation

DeepRL

Features:

Dependences

Install

Modules:

1. Agent

2. Replay

3. Train

4. Env

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages