Jmm/one buffer #494

Yurlungur · 2021-04-21T16:42:56Z

PR Summary

Built on the discussions in the hackathon and on @pgrete 's unification of communication buffers in #493 . This combines the flux and coarse cell buffers into a single kokkos view. A couple of things to note:

This reduces the number of allocations per variable per meshblock by 3.
This relies on LayoutRight in our kokkos views.
This is wasteful of some memory because the coarse buffer is now the same size as the dense buffer.
This is built on Use subviews for buffers #493 and should go in after it.

I haven't done any profiling yet.

PR Checklist

Code passes cpplint
New features are documented.
Adds a test for any bugs fixed. Adds tests for new features.
Code is formatted
Changes are summarized in CHANGELOG.md
(@lanl.gov employees) Update copyright on changed files

forrestglines

I'm curious to see the fix to the unit test, but once that's in we should merge

forrestglines · 2021-04-27T14:35:10Z

src/interface/variable.cpp

+    // This wastes about 1/2 a meshblock in memory
+    coarse_s = ParArrayND<T>(Kokkos::subview(comm_data_, offset++, Kokkos::ALL(),
+                                             Kokkos::ALL(), Kokkos::ALL(), Kokkos::ALL(),
+                                             Kokkos::ALL(), Kokkos::ALL()));


Getting around the wasted memory would probably mean allocating from fluxes and this coarse data from one long array, right? However, I don't think Kokkos supports changing layouts like that with a subview.

Yeah, I tried that actually. Kokkos yelled at me because types.

forrestglines · 2021-04-27T14:37:01Z

src/interface/variable.cpp

  }
+  n_outer += (pmb->pmy_mesh->multilevel);
+  comm_data_ = ParArray7D<T>(base_name + ".comm_data", n_outer, GetDim(6), GetDim(5),


Is comm_data_ only the collection of 4 arrays - the 3 fluxes and the coarse array? Does the benefit of combining these allocations outweight the cost of wasted memory for coarse_s? Could coarse_s remain it's own array?

This is less relevant for this PR, but is allocating space for all the fluxes necessary? Could the flux divergence be computed in each dimension one by one so that only one flux array is needed in memory? If possible, this would allow larger problem sizes but might be too big a code change.

Is comm_data_ only the collection of 4 arrays - the 3 fluxes and the coarse array? Does the benefit of combining these allocations outweight the cost of wasted memory for coarse_s? Could coarse_s remain it's own array?

That could be done. I'm not sure what the right trade-off is. I would say we accept the memory overhead for now, and we can split out the coarse buffer later if we desire.

This is less relevant for this PR, but is allocating space for all the fluxes necessary? Could the flux divergence be computed in each dimension one by one so that only one flux array is needed in memory? If possible, this would allow larger problem sizes but might be too big a code change.

This is an interesting idea. I agree not for this PR, but something to think about.

… meshblock pointer isn't available

pgrete and others added 4 commits April 20, 2021 22:12

Use subviews for buffers

b878db8

Add changelog and fix style

1fd874b

Add 7D parArray

4450679

coalesce fluxes and coarse cells into a single kokkos view

df4b7f9

Yurlungur requested review from pgrete, jlippuner, felker and forrestglines April 21, 2021 16:43

changelog and copyright

542f39a

forrestglines reviewed Apr 27, 2021

View reviewed changes

tweak variable::allocateComms so flux allocation isn't skipped if the…

e19b803

… meshblock pointer isn't available

pgrete merged commit 0607afc into hackathon_integration Apr 27, 2021

Yurlungur mentioned this pull request Apr 27, 2021

Use subviews of a single view for fluxes #502

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jmm/one buffer #494

Jmm/one buffer #494

Yurlungur commented Apr 21, 2021 •

edited

Loading

forrestglines left a comment

forrestglines Apr 27, 2021

Yurlungur Apr 27, 2021

forrestglines Apr 27, 2021

forrestglines Apr 27, 2021

Yurlungur Apr 27, 2021

Jmm/one buffer #494

Jmm/one buffer #494

Conversation

Yurlungur commented Apr 21, 2021 • edited Loading

PR Summary

PR Checklist

forrestglines left a comment

Choose a reason for hiding this comment

forrestglines Apr 27, 2021

Choose a reason for hiding this comment

Yurlungur Apr 27, 2021

Choose a reason for hiding this comment

forrestglines Apr 27, 2021

Choose a reason for hiding this comment

forrestglines Apr 27, 2021

Choose a reason for hiding this comment

Yurlungur Apr 27, 2021

Choose a reason for hiding this comment

Yurlungur commented Apr 21, 2021 •

edited

Loading