Name		Name	Last commit message	Last commit date
parent directory ..
Beschleunigen.png		Beschleunigen.png
Direction_endcoding.png		Direction_endcoding.png
Directions_Legend.png		Directions_Legend.png
Directions_Legend.svg		Directions_Legend.svg
FullCourse_MonteCarlo_Solved.png		FullCourse_MonteCarlo_Solved.png
Gym_RL.svg		Gym_RL.svg
Monte-Carlo_Methods.ipynb		Monte-Carlo_Methods.ipynb
RL_GYM_racetrack.png		RL_GYM_racetrack.png
Racetrack1.png		Racetrack1.png
break.png		break.png
racetrack_environment.py		racetrack_environment.py
readme.md		readme.md
solution_2.gif		solution_2.gif
solution_2_2.gif		solution_2_2.gif

readme.md

Exercise 04

In this exercise we will use the included racetrack_environment in order to write our first reinforcement learning algorithm. The used algorithm is Monte-Carlo learning in an on- and off-policy fashion.

Tasks:

policy evaluation using first-visit Monte-Carlo
on-policy epsilon-greedy control using first-visit Monte-Carlo
off-policy epsilon-greedy control with weighted importance sampling Monte-Carlo
extra challenge

(Source: https://media.giphy.com/media/UqZ4imFIoljlr5O2sM/giphy.gif)