-
Notifications
You must be signed in to change notification settings - Fork 6.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] Deflake replay buffer demo - Up test size to leave a little more time for ReplayBufferDemo #31429
[RLlib] Deflake replay buffer demo - Up test size to leave a little more time for ReplayBufferDemo #31429
Conversation
Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
need 15 mins to run 10 iterations??
what is taking all this time? should we just make an iteration shorter by setting min_time_s_per_iteration?
Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>
@gjoliver min_time_s_per_iteration is None by default, but we could lower minimum samples per iteration from 1000 to something lower. However then the script probably would never learn, also not on user's machines. |
how long does it take to reach 50 now that we have 3 workers? does it ever stop because of 10-iter limit? |
It usually stops bc of the 10-iter limit. |
Should be more than enough to make this green I guess |
this is very cool! |
Yeah let's hope it has a super similar effect on CI! 🥳 |
…layBufferDemo (#31429) make medium again and add workers Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com>
…layBufferDemo (ray-project#31429) make medium again and add workers Signed-off-by: Artur Niederfahrenhorst <artur@anyscale.com> Signed-off-by: tmynn <hovhannes.tamoyan@gmail.com>
Signed-off-by: Artur Niederfahrenhorst artur@anyscale.com
Why are these changes needed?
The replay buffer demo likes to take approx 10 iterations to reach the nice reward of 50 but often gets cancelled earlier (for example after 7 like in the picture) because it is marked as a test of size medium.