-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Agent task spawning performance #341
Comments
Thanks! This looks similar to the data Andre presented, right? Anyway, full-utilisation is not an absolute goal, we need to get a better intuition of what we would consider acceptable performance for the application workloads that we want to support. Startup time can be improved, I think that is the general consensus. I guess we can use this ticket to discuss some possible improvements, but as said, I would also like to collect reasonable expectations from an application perspective. On 10 Sep 2014, at 9:42 , mturilli notifications@github.com wrote:
|
Mark, agree with all. The purpose of the ticket is indeed to collect data, and to discuss the target line... Thanks! |
Agreed on all. Shall we keep a list somewhere of the user communities and type/sample of workloads that we already/need to support? I believe having something we could run ourselves would be very valuable for testing and profiling. |
We had an interesting conversation about this ticket with Shantenu. Here a brief summary of the topic and ideas discussed:
|
I certainly agree with the above. Can we (as in RP) ask/task you (as in MD-folx) to take care of this, i.e. to gather those use cases and to derive explicit qualitative and quantitative RP requirements? |
ping to team MD to reply. |
ping to team MD to reply. Otherwise the ticket can probably go, as the pilot start time (the specific ticket topic) has mostly been taken care of, and performance / utilization is sufficiently addressed / discussed in other contexts. |
ping to team MD to reply. This is the final call before the ticket is closed. We'll raise the topic during the next pilot call, too. |
obsolete and superseded... |
Evidence for the need to optimize task time startup:
The diagram shows 128 short tasks run by one out of 2 available pilots. Note that the pilot never reaches full utilization. A possible interpretation is: "the pilot manages to start ~40 CUs, then the first one seem to die off and the pilot is not keeping up with unit spawning". If confirmed, this is evidence for the need to optimize the time of task spawning within the agent.
The text was updated successfully, but these errors were encountered: