This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient
q-learning pytorch dqn epsilon-greedy breakout sarsa policy-iteration value-iteration monte-carlo-methods deep-q-learning model-based-rl model-free-rl td-methods model-free-control
-
Updated
Dec 8, 2019 - Python