PPO

PPO implementation for OpenAI gym environment based on Unity ML Agents: https://github.com/Unity-Technologies/ml-agents

Notable changes include:

Ability to continuously display progress with non-stochastic policy during training
Works with OpenAI environments
Option to record episodes
State normalization for given number of frames
Frame skip
Faster reward discounting etc.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
__pycache__		__pycache__
agents		agents
models		models
ppo		ppo
working model		working model
README.md		README.md
best-practices-ppo.md		best-practices-ppo.md
graphexporter.py		graphexporter.py
ppo.py		ppo.py
test.py		test.py

Provide feedback