[ICML 2024] Author's Implementation of RVI-SAC
-
Updated
Aug 12, 2024 - Python
[ICML 2024] Author's Implementation of RVI-SAC
Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."
Add a description, image, and links to the average-reward topic page so that developers can more easily learn about it.
To associate your repository with the average-reward topic, visit your repo's landing page and select "manage topics."