Materialization of DFT field data on the master only #1797

ahoenselaar · 2021-10-22T00:29:48Z

DFT fields returned to the user are gathered in full on each process. This can result in massive memory allocations, especially when multiple Meep processes are running on the same physical node.

In many scenarios, the DFT field data is not needed on each worker/process but only on the master. Being able to materialize DFT fields on the master only could alleviate many out-of-memory situations.

stevengj · 2021-10-23T00:41:01Z

(As much as possible, it would be nice to avoid collecting the DFT field data on any single process—which ultimately won't scale, because all the DFT fields will overwhelm the memory available on any single process if the problem gets big enough—but rather to leave it distributed and to compute with it in that form, e.g. as we do for the near-to-far transformation.)

stevengj · 2021-11-03T01:37:26Z

In particular, the adjoint solver should never call get_dft — it should compute a distributed dot product of the forward and adjoint fields.

stevengj · 2021-11-10T03:40:00Z

The outline I have in mind is:

Distribute the computation of the "dot product" between the forward and adjoint fields that gives the gradient. This way, you won't need to collect the DFT fields on any process. However, the degrees of freedom and the gradients (just 2d arrays for 2d material grids) will still be replicated — these are much smaller than the DFT fields for 2d material grids, though!
In the long run, one could have a distributed version of the CCSA algorithm, so that each process only stores a portion of the degrees of freedom (e.g. you break the material grid into "chunks" according to process boundaries, and chunks that are not needed are not stored locally) and the CCSA algorithm operates in parallel on distributed data. This way, from the user perspective it will be the same as now — you just write one "serial" Meep script that happens to run in parallel — but it will scale better to huge problems (e.g. volumetric degrees of freedom for 3d printing).

oskooi added the enhancement label Oct 22, 2021

smartalecH mentioned this issue Nov 19, 2021

Non-materialized DFT field storage for streamlined adjoint calculations #1832

Closed

oskooi mentioned this issue Nov 20, 2021

support for single-precision floating point for fields array functions #1833

Merged

smartalecH mentioned this issue Apr 13, 2022

Non-materialized dft fields for adjoint calculations #1855

Merged

4 tasks

stevengj closed this as completed in #1855 Apr 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Materialization of DFT field data on the master only #1797

Materialization of DFT field data on the master only #1797

ahoenselaar commented Oct 22, 2021

stevengj commented Oct 23, 2021 •

edited

Loading

stevengj commented Nov 3, 2021

stevengj commented Nov 10, 2021 •

edited

Loading

Materialization of DFT field data on the master only #1797

Materialization of DFT field data on the master only #1797

Comments

ahoenselaar commented Oct 22, 2021

stevengj commented Oct 23, 2021 • edited Loading

stevengj commented Nov 3, 2021

stevengj commented Nov 10, 2021 • edited Loading

stevengj commented Oct 23, 2021 •

edited

Loading

stevengj commented Nov 10, 2021 •

edited

Loading