A Multi-step Minimax Q-learning Algorithm

This repository contains the code for the new multi-step algorithm proposed for solving two-player zero-sum stochastic games. This repository can be utilized to reproduce the results provided in the paper titled "A Multi-step Minimax Q-learning algorithm for two-player zero-sum Markov games."

Requirements: nashpy package

Acknowledgments

This code is based on and adapted from Two-Player-SOR. Many thanks to the original authors for their contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
SubsectionA		SubsectionA
SubsectionB		SubsectionB
SubsectionC		SubsectionC
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Multi-step Minimax Q-learning Algorithm

Acknowledgments

About

Releases

Packages

Languages

shreyassr123/Multi-Step-Markov-Games

Folders and files

Latest commit

History

Repository files navigation

A Multi-step Minimax Q-learning Algorithm

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages