[RLlib] RLTrainer stand-alone unittests #31552

kouroshHakha · 2023-01-10T01:46:22Z

Why are these changes needed?

This PR implements the stand-alone unittests for RLTrainer. Right now it only includes tf stuff, later we will extend it to torch.
This follows up this #31511

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

2. multi-gpus tests pass now Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

kouroshHakha · 2023-01-10T01:49:32Z

rllib/core/rl_trainer/tests/tf/test_tf_rl_trainer.py

@@ -23,6 +25,7 @@ def setUp(cls) -> None:
    def tearDown(cls) -> None:
        ray.shutdown()

+    @pytest.mark.skip


We'll reactivate this test as a trainer_runner test once we iterate through the interface of trainer_runner and algorithm to solidify its design.

2. Added the unittest to the BUILD file Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

…o rltrainer-unittests

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

avnishn · 2023-01-10T18:46:59Z

rllib/core/rl_trainer/tests/test_rl_trainer.py

+
+        with tf.GradientTape() as tape:
+            params = trainer.module[DEFAULT_POLICY_ID].trainable_variables
+            loss = {"total_loss": sum([tf.reduce_sum(param) for param in params])}


nice property that you've used here.

avnishn · 2023-01-10T18:54:18Z

rllib/utils/test_utils.py

@@ -1146,7 +1146,7 @@ def get_cartpole_dataset_reader(batch_size: int = 1) -> "DatasetReader":
        get_dataset_and_shards,
    )

-    path = "tests/data/cartpole/large.json"
+    path = "rllib/tests/data/cartpole/large.json"


I think you need to remove this prepend

yep yep. just pushed a patch.

* moved rl_optimizer logic into rl_trainer * added in_test to RLTrainer to allow doing test-specific stuff * multi-gpus tests pass now Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

kouroshHakha added 28 commits January 5, 2023 19:05

moved rl_optimizer logic into rl_trainer

d8571fe

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

wip

d7a6a24

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

wip

888b226

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

wip

e518e15

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

1. added in_test to RLTrainer to allow doing test-specific stuff

f5416b1

2. multi-gpus tests pass now Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

moved the dataset reader logic into a test_util method

6286208

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

all multi-gpu unittests are now passing

4583da1

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

updated docstrings

05dfe6c

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

docstrings

705c6ea

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

wip

6ba746c

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

removed optimizers from the ci test suit

2625430

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

revert rllib prefix

131d4c9

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

skipping rl_optimizer tests

7aed1ca

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Merge branch 'master' into rltrainer-is-all-you-need

5f8474f

fixes recreation of optimizers when add_module() is used

2dec0d7

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

fixed the recreation issue, both unittests pass now

9a201c8

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

a little clean up

9173668

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

wip

3dd8aa3

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

lint

f81f8a8

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Merge branch 'rltrainer-is-all-you-need' into rltrainer-unittests

bb174c7

wip

e991204

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Canceled auto-inference of optimizer class when add_module is called

3fc37be

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

moved the framework specific stuff from the baseclass to subclasses

2f9f392

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Merge branch 'rltrainer-is-all-you-need' into rltrainer-unittests

a47b70c

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Merge branch 'master' into rltrainer-is-all-you-need

9dce11f

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

removed rl_optimizer stuff

0afaf88

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Merge branch 'rltrainer-is-all-you-need' into rltrainer-unittests

a61266a

added the isolated unittest

a10b79c

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

kouroshHakha requested review from sven1977 and gjoliver as code owners January 10, 2023 01:46

kouroshHakha requested review from avnishn, ArturNiederfahrenhorst, smorad, maxpumperla and krfricke as code owners January 10, 2023 01:46

kouroshHakha commented Jan 10, 2023

View reviewed changes

kouroshHakha assigned avnishn Jan 10, 2023

kouroshHakha added 4 commits January 10, 2023 09:50

1. Made some private getter methods public

ca2aaaf

2. Added the unittest to the BUILD file Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

1. Made some private getter methods public

e609c71

2. Added the unittest to the BUILD file Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

Merge branch 'rltrainer-unittests' of github.com:kouroshHakha/ray int…

f14672f

…o rltrainer-unittests

Merge branch 'master' into rltrainer-unittests

b687339

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

kouroshHakha assigned gjoliver Jan 10, 2023

gjoliver approved these changes Jan 10, 2023

View reviewed changes

revert test_utils

b7edbe1

Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>

avnishn reviewed Jan 10, 2023

View reviewed changes

avnishn approved these changes Jan 10, 2023

View reviewed changes

gjoliver merged commit f4f36a9 into ray-project:master Jan 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] RLTrainer stand-alone unittests #31552

[RLlib] RLTrainer stand-alone unittests #31552

kouroshHakha commented Jan 10, 2023 •

edited

Loading

kouroshHakha Jan 10, 2023

avnishn Jan 10, 2023

avnishn Jan 10, 2023

kouroshHakha Jan 10, 2023

[RLlib] RLTrainer stand-alone unittests #31552

[RLlib] RLTrainer stand-alone unittests #31552

Conversation

kouroshHakha commented Jan 10, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

kouroshHakha Jan 10, 2023

Choose a reason for hiding this comment

avnishn Jan 10, 2023

Choose a reason for hiding this comment

avnishn Jan 10, 2023

Choose a reason for hiding this comment

kouroshHakha Jan 10, 2023

Choose a reason for hiding this comment

kouroshHakha commented Jan 10, 2023 •

edited

Loading