-
Notifications
You must be signed in to change notification settings - Fork 323
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Advantage Actor Critic (A2C) Model #598
Advantage Actor Critic (A2C) Model #598
Conversation
Hello @blahBlahhhJ! Thanks for updating this PR. There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻 Comment last updated at 2021-08-13 12:58:07 UTC |
Codecov Report
@@ Coverage Diff @@
## master #598 +/- ##
===========================================
- Coverage 71.64% 24.32% -47.32%
===========================================
Files 119 120 +1
Lines 7367 7486 +119
===========================================
- Hits 5278 1821 -3457
- Misses 2089 5665 +3576
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
@akihironitta
Let me know what I should do next for this PR. Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@blahBlahhhJ Hi, thank you for your contribution! The implementation looks great! I added a few commits directly to your branch to add its doc and to fix some minor issues. Also, I left some comments below, so would you mind having a look at them?
@blahBlahhhJ Thanks for the update!
|
for more information, see https://pre-commit.ci
…tning-bolts into feature/596_a2c
return batch[0][0][0].device.index if self.on_gpu else "cpu" | ||
|
||
@staticmethod | ||
def add_model_specific_args(arg_parser: ArgumentParser) -> ArgumentParser: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Consider using LightningCLI
see: https://devblog.pytorchlightning.ai/auto-structuring-deep-learning-projects-with-the-lightning-cli-9f40f1ef8b36
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi, I looked a bit into it, and It seems like in order to add the arguments (add_model_specific_args()
) in the lightning module, I'll have to write my own subclass of LightningCLI
, which defeats the purpose to use it to simplify the code. Let me know if my understanding is wrong. Or is there an example of using it in other RL algorithms?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the LightningCLI
takes and maps what ever you have in Module init...
What does this PR do?
Update for #596 (issue)
Implementation of A2C model for Reinforcement Learning
Before submitting
PR review
Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.
Did you have fun?
Make sure you had fun coding 🙃