Q-Learning Agent for CliffWalking

Project Overview

This project implements a Q-Learning agent to solve the CliffWalking environment from OpenAI Gym. The agent is trained to navigate a grid world environment, avoiding cliffs and finding the shortest path to the goal.

Features

Implementation of the Q-Learning algorithm.
Epsilon-greedy strategy for action selection.
Training and testing phases for performance evaluation.
Ability to save and load trained Q-tables.

Requirements

Python 3.x
OpenAI Gym
NumPy

Usage

Run the script: python3 main.py
Follow the prompt to load an existing Q-table or train a new agent.

Q-Learning Agent

The agent is designed to:

Learn optimal policies via Q-Learning.
Use an epsilon-greedy strategy for a balance between exploration and exploitation.

Training

The agent is trained over a specified number of episodes, learning to maximize rewards in the CliffWalking environment.
The Q-table records the value of taking certain actions in specific states.

Testing

The agent's performance is evaluated over a number of test episodes.
Rewards per episode are recorded to gauge the effectiveness of the learned policy.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
main.py		main.py
q_table.npy		q_table.npy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-Learning Agent for CliffWalking

Project Overview

Features

Requirements

Usage

Q-Learning Agent

Training

Testing

About

Releases

Packages

Languages

kkKaan/q-learning-openai-gym

Folders and files

Latest commit

History

Repository files navigation

Q-Learning Agent for CliffWalking

Project Overview

Features

Requirements

Usage

Q-Learning Agent

Training

Testing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages