Skip to content

Reinforcement learning agents that can beat a human in a game of laser hockey. Team project on the course Reinforcement Learning WS 20/21 @ University of Tuebingen

License

Notifications You must be signed in to change notification settings

anticdimi/laser-hockey

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Laser Hockey Reinforcement Learning Challenge

This repository contains our team's winning entry for the Laser Hockey challenge as part of the Reinforcement Learning course offered at Eberhard Karls University of Tuebingen (Germany). The agents are trained on the modified Laser Hockey environment, which can be found here and installed as a pip package with: pip install git+https://github.com/antic11d/laser-hockey-env.git

Laser Hockey is a custom environment built using the Open AI gym. The environment is essentially a two player hockey game, in which the agents compete to score a goal against each other. Although seemingly simple, the environment encapsulates a lot of complexities and hardships under the hood.

Laser hockey gameplay

Is reinforcement learning truly needed to find an optimal policy for playing the game? In short, yes. We demonstrated that our trained reinforcement learning agents easily manage to defeat the algorithmic basic opponent provided by the environment.

Moreover, our solution was the winning entry in the tournament between all trained agents from the participants. This tournament consisted of two phases:

  1. A regular phase that included 70+ entries from all course participants
  2. A play-off phase that included only the top 10 teams from the regular session

The leaderboard with the final results from the play-off phase can be found here. Furthermore, the certificate for winning the competition can be found here.

We presented both discrete and continuous action-space solutions for this problem. In particular, these are the algorithms that each of the authors have implemented:

  1. Dueling DQN with Prioritized Experience Replay (Zafir Stojanovski)
  2. Soft Actor-Critic (Dimitrije Antic)
  3. Deep Deterministic Policy Gradient (Jovan Cicvaric)

An extensive report containing detailed algorithm descriptions, ablation/sensitivity studies on the model's hyperparameters, and tricks that played an important role in helping us win the challenge could be found here.

About

Reinforcement learning agents that can beat a human in a game of laser hockey. Team project on the course Reinforcement Learning WS 20/21 @ University of Tuebingen

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages