PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
-
Updated
Feb 24, 2021 - Python
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
Pytorch implementation of large network design in continous control RL.
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.
Durham University, Dissertation: 1st - 92. Additional Materials and Codebase for the paper: Combining Recent Advances in Reinforcement Learning for Super Mario Bros. - Recurrent Replay Deeper Denser Distributed DQN+ (R2D4+).
PyTorch implementation of Twin Delayed Deep Deterministic Policy Gradient (TD3) - including additional Extension to improve the algorithm's performance.
Add a description, image, and links to the d2rl topic page so that developers can more easily learn about it.
To associate your repository with the d2rl topic, visit your repo's landing page and select "manage topics."