[RLlib] Add addtional_update to RL Trainer #31541

avnishn · 2023-01-09T21:31:52Z

Signed-off-by: Avnish avnishnarayan@gmail.com

Add the additional updates function to the RL-Trainer.

For example, it could be used to do a polyak averaging update
of a target network in off policy algorithms like SAC or DQN.

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Avnish <avnishnarayan@gmail.com>

kouroshHakha

Quick comment.

kouroshHakha · 2023-01-09T22:11:02Z

rllib/core/rl_trainer/rl_trainer.py

@@ -205,6 +205,14 @@ def update(self, batch: MultiAgentBatch) -> Mapping[str, Any]:
            self.do_distributed_update(batch)
        return self.compile_results(batch, fwd_out, loss, post_processed_gradients)

+    def additional_update(self) -> Mapping[str, Any]:


should it not take in any parameters? like *args, **kwargs? what should be returned? also provide some more context on where this would get called?

fair enough, I thought there wasn't a point to adding args kwargs here because users can just do whatever they want

then it should be *args, **kwargs to make it explicit. But the returned value should have some explaination.

Signed-off-by: Avnish <avnishnarayan@gmail.com>

Add addtional_update to RL Trainer

c042060

Signed-off-by: Avnish <avnishnarayan@gmail.com>

avnishn requested review from sven1977, gjoliver, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners January 9, 2023 21:31

avnishn assigned gjoliver and kouroshHakha Jan 9, 2023

kouroshHakha reviewed Jan 9, 2023

View reviewed changes

Address feedback add functionality to trainer runner

01e5903

Signed-off-by: Avnish <avnishnarayan@gmail.com>

kouroshHakha approved these changes Jan 9, 2023

View reviewed changes

gjoliver merged commit 913ab73 into ray-project:master Jan 9, 2023

AmeerHajAli pushed a commit that referenced this pull request Jan 12, 2023

[RLlib] Add addtional_update to RL Trainer (#31541)

5cb4d73

Signed-off-by: Avnish <avnishnarayan@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Add addtional_update to RL Trainer #31541

[RLlib] Add addtional_update to RL Trainer #31541

avnishn commented Jan 9, 2023

kouroshHakha left a comment

kouroshHakha Jan 9, 2023

avnishn Jan 9, 2023

kouroshHakha Jan 9, 2023

avnishn Jan 9, 2023

[RLlib] Add addtional_update to RL Trainer #31541

[RLlib] Add addtional_update to RL Trainer #31541

Conversation

avnishn commented Jan 9, 2023

Why are these changes needed?

Related issue number

Checks

kouroshHakha left a comment

Choose a reason for hiding this comment

kouroshHakha Jan 9, 2023

Choose a reason for hiding this comment

avnishn Jan 9, 2023

Choose a reason for hiding this comment

kouroshHakha Jan 9, 2023

Choose a reason for hiding this comment

avnishn Jan 9, 2023

Choose a reason for hiding this comment