Request: Speed up the uniform mesh converter #991

keckler · 2022-11-21T22:23:52Z

The uniform mesh converter can sometimes take a very long time. However the majority of what it does is loop over all the assemblies in the core and assign parameters back and forth between blocks.

Calls to setAssemblyStateFromOverlaps are all completely independent and could benefit from parallel execution. Based on looking at the uniform mesh generator, this should cover the majority of the work in that class and I think could speed up the conversion process a ton.

The text was updated successfully, but these errors were encountered:

ntouran · 2022-11-22T06:20:12Z

Can we at least run the operation through the profiler first to see exactly what the slow bottlenecks are with precision instrumentation? Then we can see if there's any superlow hanging fruit before parallelizing.

keckler · 2022-11-22T06:22:46Z

Yes, a very good idea! This was just based on the keckler-profiler, aka my eyeballs looking at the exploding runtimes.

jakehader · 2022-11-22T12:37:15Z

Can we at least run the operation through the profiler first to see exactly what the slow bottlenecks are with precision instrumentation? Then we can see if there's any superlow hanging fruit before parallelizing.

I did this months ago but didn'tseem to write it down. It is slow on getting and setting number densities for each block in the core. I feel like this suggestion keeps getting made. @john-science did timing tests of this already a couple months ago too.

See: #721

keckler · 2022-11-28T16:53:06Z

It looks like the uniform mesh converter may be getting a proper overhaul in the near future, so if this is implemented, it should be a part of that larger work.

john-science · 2022-12-05T23:22:26Z

A good process for speeding up code goes something like:

Profile the code.
Review the code profile and see where the slow-downs are and what expensive operations are being called inside nested loops that can easily be improved.
See what data structures are being used.
See if we can lean more on NumPy versus pure Python
and on and on and on...
Finally, and Very Last: parallelizing the code.

It is not appropriate to say "we need to parallelize this code" before you've done any work to improve it's performance.

john-science · 2022-12-05T23:25:28Z

Also, and not for nothing, you want to parallelize the uniform mesh converter? Even though the ARMI model is already run in parallel?

You can't spawn parallel code INSIDE code that's already parallel.

Is the uniform mesh converter run outside the already parallel code in the ARMI interface loop?

mgjarrett · 2022-12-19T19:15:51Z

I'm still experimenting with the profiler, but here are some preliminary results. I ran a script that loads up a reactor, does a uniform mesh conversion and then de-conversion. 50% of the overall run time is spent in makeAssemWithUniformMesh, with most of that being spent in this block.deepcopy():

I don't think we really need a deepcopy here. We just want a new block that has the same name and as the original. We're going to overwrite everything else (height, number densities, parameters, etc.) We could probably cut down a good chunk of this overhead if we avoid a deepcopy.

Note: This was a small test reactor; the proportion of run time in each function call might change when we run on a full-size model.

mgjarrett · 2022-12-20T02:33:56Z

I implemented a method for instantiating a new block that's much lighter than a deepcopy. This reduced the time spent on creating new blocks by an order of magnitude, and now the long pole in the tent is parameterDefinitions.__getitem__.

In a test case where makeAssemWithUniformMesh is called 155 times (for 155 assemblies), parameterDefinitions.__getitem__ is called 2,523,029 times.

The expensive line is this one, which walks through all of the parameters defined in the collection and performs a string comparison to the name that was passed in to the function.

armi/armi/reactor/parameters/parameterDefinitions.py

Line 413 in 59b0cad

matches = [pd for pd in self if pd.name == name]

I think we can cut run time down by at least an order of magnitude if this can be converted to a hashed lookup. I opened #1039 to track developments on that front.

john-science · 2022-12-27T22:24:55Z

Great progress @mgjarrett !

Removing a deepcopy() is usually a win. In most languages, doing a "deep copy" of an object is an expensive operation. So, that makes good sense.

Also, I like that you found one line that was costing a huge portion of the run time. You'd be surprised how often that's true.

You're burying the lead though. I've always here it was the number densities that were the slowest part of ARMI, specifically the meshing. So... you did an actual profile and found otherwise. That's great! Way to use analytical data, instead of just trusting the standard wisdom. You rock.

john-science added the feature request Smaller user request label Nov 22, 2022

ntouran added the optimization related to measuring and speeding up the code or reducing memory label Nov 22, 2022

mgjarrett mentioned this issue Dec 20, 2022

Request for improved parameter lookup speed #1039

Closed

mgjarrett mentioned this issue Dec 20, 2022

Efficiency improvement #1042

Merged

7 tasks

keckler changed the title ~~Request: Implement the uniform mesh converter in parallel~~ Request: Speed up the uniform mesh converter Dec 22, 2022

keckler linked a pull request Jan 13, 2023 that will close this issue

Efficiency improvement #1042

Merged

7 tasks

mgjarrett closed this as completed in #1042 Jan 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request: Speed up the uniform mesh converter #991

Request: Speed up the uniform mesh converter #991

keckler commented Nov 21, 2022

ntouran commented Nov 22, 2022

keckler commented Nov 22, 2022

jakehader commented Nov 22, 2022 •

edited

Loading

keckler commented Nov 28, 2022

john-science commented Dec 5, 2022 •

edited

Loading

john-science commented Dec 5, 2022

mgjarrett commented Dec 19, 2022 •

edited

Loading

mgjarrett commented Dec 20, 2022

john-science commented Dec 27, 2022 •

edited

Loading

Request: Speed up the uniform mesh converter #991

Request: Speed up the uniform mesh converter #991

Comments

keckler commented Nov 21, 2022

ntouran commented Nov 22, 2022

keckler commented Nov 22, 2022

jakehader commented Nov 22, 2022 • edited Loading

keckler commented Nov 28, 2022

john-science commented Dec 5, 2022 • edited Loading

john-science commented Dec 5, 2022

mgjarrett commented Dec 19, 2022 • edited Loading

mgjarrett commented Dec 20, 2022

john-science commented Dec 27, 2022 • edited Loading

jakehader commented Nov 22, 2022 •

edited

Loading

john-science commented Dec 5, 2022 •

edited

Loading

mgjarrett commented Dec 19, 2022 •

edited

Loading

john-science commented Dec 27, 2022 •

edited

Loading