Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A variety of changes #108

Merged
merged 3 commits into from
Sep 1, 2024
Merged

Conversation

pyprogrammer
Copy link
Contributor

Hi Xudong, I've been using May for some of our simulation research, and had a few changes I'd like to make to May that I think would be generically helpful. I'd also be happy to split these out into separate pull requests if it'd make it easier for you, but the overall changes are quite short.

  1. Made timeout a config option
  2. Made core pinning a config option
  3. Added spawn_builder to scopes
  4. Switched to using Crossbeam for work stealing
  5. Rewrote work-stealing scheduler to be cleaner. Also added randomized work-stealing instead of always checking the next few.

1. Made timeout a config option
2. Made core pinning a config option
3. Added spawn_builder to scopes
4. Switched to using Crossbeam for work stealing
5. Rewrote work-stealing scheduler to be cleaner.
- 5.1: Uses randomized work stealing instead of always checking the next few.
@Xudong-Huang
Copy link
Owner

thanks for the changes!
I was using crossbeam for work stealing before, but found it some slow, so I use my own may_queue to do it.
Have you ever benchmark the performance changes?

Also add random work stealing would impact the performance, but I didn't have the numbers.

Other changes looks good to me.

@pyprogrammer
Copy link
Contributor Author

I'll revisit the perf impact between Crossbeam and may_queue -- I remember seeing small performance improvements but I'll rerun them and gather the info.

Re: Randomized work-stealing: the existing system only scans the next few queues before exiting the loop (which blocks at the next epoll).
This can cause many workers to go to sleep if at any point in the program only a few coroutines are unblocked. The adjacent workers can steal, which in turn allows the worker before that to steal, etc. This in turn would cause a steal-chain to form. Randomized work stealing pays a small overhead for the RNG, but can better avoid such pathological conditions.

This could also be a candidate for potentially turning into a configuration or a feature flag; long-lived coroutines which frequently wait will likely prefer randomized work-stealing, while short-lived coroutines may prefer the lower overhead of the current system. I implemented randomized work-stealing because my applications were running into the pathological conditions mentioned above, and Go seems to have had success with a randomized work-stealer.

@Xudong-Huang
Copy link
Owner

Xudong-Huang commented Apr 6, 2024

I like the feature gate to do random work stealing. Also, let's keep using the may_queue. If you really need crossbeam for work stealing, I also like a feature gate for that.

@pyprogrammer
Copy link
Contributor Author

Hi Xudong, just checking in to let you know that I'm still working on this. I've been swamped by camera-ready and artifact evaluation for the tool we built on top of May.

rand_work_steal enables randomized work stealing, using fastrand

crossbeam_queue_steal uses crossbeam's queue implementation instead of may's implementation.
Functionally speaking, this causes the steal to attempt to steal 50% instead of just the next block.
@pyprogrammer
Copy link
Contributor Author

pyprogrammer commented May 3, 2024

Sorry for the delay.

I've implemented the crossbeam-queue and randomized work-stealing as two separate features, and run the four feature flag combinations on my local machine. Based on these results, it appears that rand-steal vs. plain work-steal is generally a wash, while crossbeam-queue is worse than the current implementation.

Regarding crossbeam-queue, would you prefer merging it in gated behind the feature flag, or would you prefer removing it altogether?

crossbeam.txt
crossbeam_and_rand.txt
rand_steal.txt
current.txt

@pyprogrammer
Copy link
Contributor Author

@Xudong-Huang Hi Xudong, have you had time to take a look recently?

@pyprogrammer
Copy link
Contributor Author

@Xudong-Huang Hi Xudong, checking in. Do you have time to take a look at the PR?

@Xudong-Huang
Copy link
Owner

thanks for this change

@Xudong-Huang Xudong-Huang merged commit 7cf4a4e into Xudong-Huang:master Sep 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants