[RLlib] Deflake replay buffer demo - Up test size to leave a little more time for ReplayBufferDemo #31429

ArturNiederfahrenhorst · 2023-01-04T11:28:29Z

Signed-off-by: Artur Niederfahrenhorst artur@anyscale.com

Why are these changes needed?

The replay buffer demo likes to take approx 10 iterations to reach the nice reward of 50 but often gets cancelled earlier (for example after 7 like in the picture) because it is marked as a test of size medium.

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

gjoliver

need 15 mins to run 10 iterations??
what is taking all this time? should we just make an iteration shorter by setting min_time_s_per_iteration?

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

ArturNiederfahrenhorst · 2023-01-04T18:28:26Z

@gjoliver min_time_s_per_iteration is None by default, but we could lower minimum samples per iteration from 1000 to something lower. However then the script probably would never learn, also not on user's machines.
I've given it three rolloutworkers now and I'm guessing its running much quicker locally. Should we try that?

gjoliver · 2023-01-04T19:18:22Z

@gjoliver min_time_s_per_iteration is None by default, but we could lower minimum samples per iteration from 1000 to something lower. However then the script probably would never learn, also not on user's machines. I've given it three rolloutworkers now and I'm guessing its running much quicker locally. Should we try that?

how long does it take to reach 50 now that we have 3 workers? does it ever stop because of 10-iter limit?

ArturNiederfahrenhorst · 2023-01-05T00:03:44Z

@gjoliver min_time_s_per_iteration is None by default, but we could lower minimum samples per iteration from 1000 to something lower. However then the script probably would never learn, also not on user's machines. I've given it three rolloutworkers now and I'm guessing its running much quicker locally. Should we try that?

how long does it take to reach 50 now that we have 3 workers? does it ever stop because of 10-iter limit?

It usually stops bc of the 10-iter limit.
On my own machine, execution time is halved:

ArturNiederfahrenhorst · 2023-01-05T00:04:05Z

Should be more than enough to make this green I guess

gjoliver · 2023-01-05T00:15:56Z

this is very cool!

ArturNiederfahrenhorst · 2023-01-05T00:16:58Z

Yeah let's hope it has a super similar effect on CI! 🥳

…layBufferDemo (#31429) make medium again and add workers Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

…layBufferDemo (ray-project#31429) make medium again and add workers Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com> Signed-off-by: tmynn <hovhannes.tamoyan@gmail.com>

up test size to leave a little more time

14d4722

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

ArturNiederfahrenhorst requested review from sven1977 and gjoliver as code owners January 4, 2023 11:28

ArturNiederfahrenhorst assigned sven1977 Jan 4, 2023

ArturNiederfahrenhorst requested review from avnishn, smorad, maxpumperla, kouroshHakha and krfricke as code owners January 4, 2023 11:28

krfricke approved these changes Jan 4, 2023

View reviewed changes

Merge branch 'master' into replaybufferflakeyness

dd4ff49

gjoliver reviewed Jan 4, 2023

View reviewed changes

make medium again and add workers

32c8af3

Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

gjoliver merged commit a96d5de into ray-project:master Jan 5, 2023

ArturNiederfahrenhorst deleted the replaybufferflakeyness branch January 5, 2023 15:39

AmeerHajAli pushed a commit that referenced this pull request Jan 12, 2023

[RLlib] Deflake replay buffer demo - Use more rollout workers for Rep…

daa1329

…layBufferDemo (#31429) make medium again and add workers Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Deflake replay buffer demo - Up test size to leave a little more time for ReplayBufferDemo #31429

[RLlib] Deflake replay buffer demo - Up test size to leave a little more time for ReplayBufferDemo #31429

ArturNiederfahrenhorst commented Jan 4, 2023 •

edited

Loading

gjoliver left a comment

ArturNiederfahrenhorst commented Jan 4, 2023

gjoliver commented Jan 4, 2023

ArturNiederfahrenhorst commented Jan 5, 2023

ArturNiederfahrenhorst commented Jan 5, 2023

gjoliver commented Jan 5, 2023

ArturNiederfahrenhorst commented Jan 5, 2023

[RLlib] Deflake replay buffer demo - Up test size to leave a little more time for ReplayBufferDemo #31429

[RLlib] Deflake replay buffer demo - Up test size to leave a little more time for ReplayBufferDemo #31429

Conversation

ArturNiederfahrenhorst commented Jan 4, 2023 • edited Loading

Why are these changes needed?

gjoliver left a comment

Choose a reason for hiding this comment

ArturNiederfahrenhorst commented Jan 4, 2023

gjoliver commented Jan 4, 2023

ArturNiederfahrenhorst commented Jan 5, 2023

ArturNiederfahrenhorst commented Jan 5, 2023

gjoliver commented Jan 5, 2023

ArturNiederfahrenhorst commented Jan 5, 2023

ArturNiederfahrenhorst commented Jan 4, 2023 •

edited

Loading