- Q-learning (Cart-Pole env.)
- Double Q-learning (Cart-Pole env.)
- Sarsa (Frozen-Lake env.)
- Sarsa(λ) (Frozen-Lake env.)
- Deep Q-Network (DQN) (Cart-Pole env.)
- REINFORCE (Cart-Pole env.)
- Advantage Actor-Critic (A2C) (Cart-Pole env.)
Online Courses:
Textbooks:
Other Helpful materials/articles:
Yeskendir Koishekenov @YeskendirK