强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行
-
Updated
Nov 2, 2023 - Python
强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行
My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow
Quantum imaginary time evolution steered by reinforcement learning
ScaleRL is a simple and scalable distributed reinforcement learning framework based on Python and PyTorch
Add a description, image, and links to the dppo topic page so that developers can more easily learn about it.
To associate your repository with the dppo topic, visit your repo's landing page and select "manage topics."