JP-2037 and JP-1926: Fixing ramp fitting multiprocessing to work with new STCAL interface. #30

kmacdonald-stsci · 2021-06-15T18:05:38Z

Ramp fitting multiprocessing is now working as it did in JWST 1.1.0 with the optional results bug detailed in JP-1926. For JP-1926, the optional results product varies in size depending on the maximum number of segments. When sliced results from each process was being reassembled into the final optional results product returned from ramp fitting, the maximum sizes per slice may be different, so this needed to be accounted for.

All unit tests pass on JWST.

dmggh

In addition to these, I may have a few other comments.

dmggh · 2021-06-16T00:41:52Z

src/stcal/ramp_fitting/ols_fit.py

-    new_model.err : ndarray
-        The output total variance for each pixel, 2-D float32
+    rows_per_slice: list
+        The number of rows in each row.


Should that be "The number of rows in each slice" ?

Yes. Thank you for catching that.

dmggh · 2021-06-16T00:44:26Z

src/stcal/ramp_fitting/ols_fit.py

-    new_model.var_rnoise : ndarray
-        The variance in each pixel due to read noise, 2-D float32
+    pool_results: list
+        The list of return values from ols_ramp_fit_single for each slice.


Perhaps state what these values are in the list.

I expanded this docstring.

dmggh · 2021-06-16T00:58:35Z

src/stcal/ramp_fitting/ols_fit.py

+        group_time=ramp_data.group_time,
+        groupgap=ramp_data.groupgap,
+        nframes=ramp_data.nframes,
+        drop_frames1=ramp_data.drop_frames1)


Isn't this meta data for same for each slice ? If so perhaps it can be moved outside the loop this function is within.

This meta data is needed by each process in multiprocessing, therefore each RampData class needs this information.

dmggh · 2021-06-16T01:05:48Z

src/stcal/ramp_fitting/ols_fit.py

-        return out_model, int_model, opt_model
-
-
-def create_output_models(input_model, number_of_integrations, save_opt,


Has create_output_models( ) been replaced by another function ?

No. There are no models being returned from ramp fitting anymore. Ramp fitting returns tuples of arrays that the step code in each pipeline will now use to create the desired data model specific to that pipeline.

For multiprocessing, however, output arrays need to be created that are assembled from the sliced return values. The function create_output_info does this in assemble_pool_results.

dmggh · 2021-06-16T01:14:32Z

src/stcal/ramp_fitting/ols_fit.py

+        if save_opt:
+            get_opt_slice(opt_info, opt_slice, current_row_start, nrows)
+        current_row_start = current_row_start + nrows
+


Does get_integ_slice( ) assemble the output info from the slices whose output info was assembled from each image by get_image_slice( ) ? Maybe add a bit more documentation to clarify what these functions do.

No. The sliced (by row) return values are returned from starmap where each sliced input is used as parameters for ols_ramp_fit_single. These sliced return values are used by the get_xxxx_slice() functions to assemble each slice into a single xxxx product to be returned from ramp fitting.

jdavies-st

This looks good!

jdavies-st · 2021-06-17T14:29:17Z

src/stcal/ramp_fitting/ols_fit.py

+    slope[:, :oslope.shape[1], srow:erow, :] = oslope
+    sigslope[:, :osigslope.shape[1], srow:erow, :] = osigslope
+    var_poisson[:, :ovar_poisson.shape[1], srow:erow, :] = ovar_poisson
+    var_rnoise[:, :ovar_rnoise.shape[1], srow:erow, :] = ovar_rnoise
+    yint[:, :oyint.shape[1], srow:erow, :] = oyint
+    sigyint[:, :osigyint.shape[1], srow:erow, :] = osigyint
+    weights[:, :oweights.shape[1], srow:erow, :] = oweights
+    crmag[:, :ocrmag.shape[1], srow:erow, :] = ocrmag


I really like the use of mnemonics here. Much easier to read code. What's the significance of shape[1]? Is that number of integrations?

The significance of shape[1] is that it is of variable size. All but the crmag depend on the maximum number of segments in each ramp. The crmag size depends on the maximum number of cosmic rays. Since it's possible for each slice to have different maximum sizes, the full optional results arrays will be set to the maximum across all slices. Since it's possible for the different slices to be of different sizes simple assignment can throw an error trying to assign arrays of different sizes.

For example, it is not possible to do

crmag[:, :, srow:erow, :] = ocrmag

if crmag[:, :, srow:erow, :].shape is (1, 5, 256, 2048), while ocrmag.shape is (1, 2, 256, 2048).

The shapes must have the same dimensions. By construction the first, third, and fourth dimensions will be the same. The second dimension of crmag will always be at least as large as the second dimension of ocrmag, but it may be larger. To ensure that this dimension is the same, it needs to be explicitly applied during assignment.

…=nd.float32, but should be dtype=np.uint32. Changed to allocate an array with the correct dtype.

codecov · 2021-07-09T17:50:30Z

Codecov Report

❗ No coverage uploaded for pull request base (main@686a1bb). Click here to learn what that means.
The diff coverage is n/a.

@@           Coverage Diff           @@
##             main      #30   +/-   ##
=======================================
  Coverage        ?   10.30%           
=======================================
  Files           ?       13           
  Lines           ?     1563           
  Branches        ?        0           
=======================================
  Hits            ?      161           
  Misses          ?     1402           
  Partials        ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 686a1bb...3604a44. Read the comment docs.

nden · 2021-07-28T13:39:58Z

@kmacdonald-stsci This needs a change log entry and it's ready to be merged.

dmggh · 2021-07-29T07:33:47Z

I realize this PR has already been merged; nevertheless LGTM.

Fixing ramp fitting multiprocessing to work with new STCAL interface.

f7210fa

kmacdonald-stsci requested review from jdavies-st, dmggh and nden June 15, 2021 18:25

dmggh reviewed Jun 16, 2021

View reviewed changes

Making changes based on code review.

22d259c

jdavies-st previously approved these changes Jun 17, 2021

View reviewed changes

jdavies-st added the ramp_fitting label Jun 17, 2021

This was referenced Jun 21, 2021

Ramp fit fails with multiprocessing spacetelescope/jwst#5756

Closed

Fix ramp fitting multiprocessing. spacetelescope/jwst#5945

Closed

Final DQ array for multiprocessing was incorrectly allocated as dtype…

9935c7a

…=nd.float32, but should be dtype=np.uint32. Changed to allocate an array with the correct dtype.

kmacdonald-stsci dismissed jdavies-st’s stale review via 9935c7a July 9, 2021 17:49

Changing DQ to be uint32, not float32.

ded9439

kmacdonald-stsci requested a review from hbushouse July 27, 2021 15:23

hbushouse previously approved these changes Jul 27, 2021

View reviewed changes

Adding addition of multiprocessing to change log.

3604a44

kmacdonald-stsci dismissed hbushouse’s stale review via 3604a44 July 28, 2021 13:44

nden approved these changes Jul 28, 2021

View reviewed changes

nden merged commit 8cf81e0 into spacetelescope:main Jul 28, 2021

kmacdonald-stsci deleted the jp_2037_multi_02 branch July 29, 2021 20:41

kmacdonald-stsci mentioned this pull request Aug 5, 2021

Changing ramp fitting multiprocessing tests spacetelescope/jwst#6254

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JP-2037 and JP-1926: Fixing ramp fitting multiprocessing to work with new STCAL interface. #30

JP-2037 and JP-1926: Fixing ramp fitting multiprocessing to work with new STCAL interface. #30

kmacdonald-stsci commented Jun 15, 2021 •

edited

Loading

dmggh left a comment

dmggh Jun 16, 2021

kmacdonald-stsci Jun 16, 2021

dmggh Jun 16, 2021

kmacdonald-stsci Jun 16, 2021

dmggh Jun 16, 2021

kmacdonald-stsci Jun 16, 2021

dmggh Jun 16, 2021

kmacdonald-stsci Jun 16, 2021 •

edited

Loading

dmggh Jun 16, 2021

kmacdonald-stsci Jun 16, 2021

jdavies-st left a comment

jdavies-st Jun 17, 2021

kmacdonald-stsci Jun 17, 2021 •

edited

Loading

codecov bot commented Jul 9, 2021 •

edited

Loading

nden commented Jul 28, 2021

dmggh commented Jul 29, 2021

		return out_model, int_model, opt_model


		def create_output_models(input_model, number_of_integrations, save_opt,

JP-2037 and JP-1926: Fixing ramp fitting multiprocessing to work with new STCAL interface. #30

JP-2037 and JP-1926: Fixing ramp fitting multiprocessing to work with new STCAL interface. #30

Conversation

kmacdonald-stsci commented Jun 15, 2021 • edited Loading

dmggh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kmacdonald-stsci Jun 16, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jdavies-st left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kmacdonald-stsci Jun 17, 2021 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Jul 9, 2021 • edited Loading

Codecov Report

nden commented Jul 28, 2021

dmggh commented Jul 29, 2021

kmacdonald-stsci commented Jun 15, 2021 •

edited

Loading

kmacdonald-stsci Jun 16, 2021 •

edited

Loading

kmacdonald-stsci Jun 17, 2021 •

edited

Loading

codecov bot commented Jul 9, 2021 •

edited

Loading