-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Memory and Core estimate function #263
Comments
The branch of this issue has a heuristic for memory estimation that seems to work relatively well but I'd still like to test it with larger partitions (going to the 100s of GBs). There are some unexpected (to me at least) memory consumers, like |
|
Here is a summary of estimation results for the branch of this ticket up to ~2024.11.12 (not latest, that will come in a follow-up comment). The memory estimates produced by the function added here seem already relatively accurate and safe (never underestimating), but could overestimate up to 5-10% (presumably more for even larger partitions than considered here).
(in the table, by default the default partition scheme is used, unless otherwise stated in the notes). The only exception to the general rule of "never underestimate" are datasets where the pointing_xds is significant/similar order or comparable in size to the main_xds. Machines / environments:
Things to keep an eye on:
|
Here is an updated and slightly extended table with the estimates after the improvements in the last few commits (after 2024.11.13):
This is now accounting for the memory used in This should be producing a "safe" level of overestimation, which I think can be further reduced once we account for the other sub-xdss that can be relevant at the level of a ~1% or so for medium-large datasets (and much more for tiny/demo/test datasets), like the pointing_xds and the syscal_xds. I'd suggest we open an issue to add estimates of those sub-xdss, to follow up on this issue/branch. |
Add a function to convert_msv2_to_processing_set.py that estimates the amount of memory per core required to convert an MSv2 -> PS(MSv4) and gives the maximum number of cores and a suggested number of cores.
The text was updated successfully, but these errors were encountered: