Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JP-3576: Implement the Ramp Fitting Likelihood Algorithm #278

Merged
merged 63 commits into from
Sep 19, 2024
Merged
Show file tree
Hide file tree
Changes from 59 commits
Commits
Show all changes
63 commits
Select commit Hold shift + click to select a range
ed89b83
First pass at adding the maximum likelihood algorithm to ramp fitting.
kmacdonald-stsci Apr 1, 2024
23684aa
Cleaning up test.
kmacdonald-stsci Apr 2, 2024
2d737d5
Updating likelihood code.
kmacdonald-stsci Apr 3, 2024
e548b08
Updating test.
kmacdonald-stsci Apr 3, 2024
f802ea2
Updating comments and computing diffs2use mask for ramp segmentation.
kmacdonald-stsci Apr 8, 2024
0885503
Attempts at computing values for the ERR array.
kmacdonald-stsci Apr 9, 2024
aa2d28a
Adding DQ flag computation for integrations. Adding testing to analy…
kmacdonald-stsci Apr 11, 2024
130e426
Began updating code to include the Poisson and read noise variance co…
kmacdonald-stsci Apr 24, 2024
b0a7da7
Updating the likelihood algorithm to compute the Poisson and read noi…
kmacdonald-stsci Apr 26, 2024
dbe44c7
Update tests and debugging functions.
kmacdonald-stsci May 1, 2024
fe54dab
Adding rate product computations.
kmacdonald-stsci May 7, 2024
6c497fb
Updating likelihood testing.
kmacdonald-stsci May 11, 2024
e2af635
Correcting the computation of alpha_readnoise.
kmacdonald-stsci May 11, 2024
6ee8337
Making some style changes and adding an exception if the ngroups are …
kmacdonald-stsci May 11, 2024
0ca1777
Adding gain to the computatios.
kmacdonald-stsci May 15, 2024
4572bef
Changes during testing of readtime computations.
kmacdonald-stsci May 15, 2024
239349c
Updating testing and debugging functions for testing.
kmacdonald-stsci May 15, 2024
29e404e
Updating likelhood algorithm initial countrate guess and differences …
kmacdonald-stsci Jun 11, 2024
e6710d6
Updating the likelihood tests and removing unnecessary print statements.
kmacdonald-stsci Jun 12, 2024
270627e
Refactoring the code.
kmacdonald-stsci Jun 13, 2024
a624de9
Updating the algorithm to choose the likelihood algorithm based on a …
kmacdonald-stsci Jun 13, 2024
1b2e379
Updating the tests for the likelihood algorithm.
kmacdonald-stsci Jun 13, 2024
886cd30
Removing unneeded comments.
kmacdonald-stsci Jun 13, 2024
b4df659
Updating the description of ramp fitting to include the likelihood al…
kmacdonald-stsci Jun 21, 2024
f28fddf
Some minor editing. Noting where bad groups need to be identified. …
kmacdonald-stsci Jun 21, 2024
11694fc
Updating some basic debugging tests.
kmacdonald-stsci Jun 26, 2024
1e27431
Suppressing warnings.
kmacdonald-stsci Jun 26, 2024
3ef5710
Updating comments.
kmacdonald-stsci Jun 28, 2024
f54eef7
Updating the computation of which diffs to use during the likelihood …
kmacdonald-stsci Jun 28, 2024
97f49e3
Debugging likelihood ramp fitting tests.
kmacdonald-stsci Jun 29, 2024
92fe6b5
Updating the formatting, as well as identifying possible jump detecti…
kmacdonald-stsci Jul 8, 2024
9f530a0
Fixed shape bug when using dark current and changed the name of a fun…
kmacdonald-stsci Aug 7, 2024
d8a041d
Adding jump detection flagging to the likelihood algorithm for ramp f…
kmacdonald-stsci Aug 7, 2024
392a0f6
Adding test for proper flagging for the likelihood jump detection in …
kmacdonald-stsci Aug 7, 2024
ca4b7b2
Removing skip for multi-pixel test.
kmacdonald-stsci Aug 7, 2024
18e58c6
Updating the jump detection test for the ramp fitting using the likel…
kmacdonald-stsci Aug 7, 2024
afb3c65
Removing debugging import from ramp fitting likelihood.
kmacdonald-stsci Aug 7, 2024
473cb6f
Updating the change log and cleaning up the code comments.
kmacdonald-stsci Aug 7, 2024
702d47b
Updating link documents for ramp fitting likelihood algorithm descrip…
kmacdonald-stsci Aug 9, 2024
f67f564
Updating docstring based on code review.
kmacdonald-stsci Aug 9, 2024
f67e8ea
Adding parameter to the RampData class to be used in the likelihood a…
kmacdonald-stsci Aug 12, 2024
453bf43
Updating the likelihood algorithm based on code review.
kmacdonald-stsci Aug 12, 2024
647f560
Updating some comments and the usage of the readnoise array from CRDS.
kmacdonald-stsci Aug 14, 2024
47758d1
Updating test due to change in read noise computation.
kmacdonald-stsci Aug 14, 2024
2163be7
Updating comments.
kmacdonald-stsci Aug 14, 2024
3ca30c1
Making updates based on code review.
kmacdonald-stsci Sep 4, 2024
383d267
Adding change log fragment for towncrier.
kmacdonald-stsci Sep 4, 2024
5276b88
Removing pedestal related code.
kmacdonald-stsci Sep 5, 2024
9f11478
Renaming file.
kmacdonald-stsci Sep 5, 2024
a3f3a89
Changes made due to code review.
kmacdonald-stsci Sep 5, 2024
151f769
Updating due to code review.
kmacdonald-stsci Sep 5, 2024
7b22dcf
Update due to code review.
kmacdonald-stsci Sep 5, 2024
cfc0e69
Updating due to code review.
kmacdonald-stsci Sep 6, 2024
d260730
Testing short ramp that should be switched to OLS_C.
kmacdonald-stsci Sep 10, 2024
cd90b7f
Updating code based on code review.
kmacdonald-stsci Sep 11, 2024
6a2abb0
Updating documentation based on code review.
kmacdonald-stsci Sep 11, 2024
9c7e438
Updating docs for likelihood algorithm in ramp fitting.
kmacdonald-stsci Sep 12, 2024
f3ba0eb
Suppressing warnings in likelihood fit computations.
kmacdonald-stsci Sep 13, 2024
f97d568
Updating the jump handling for the likelihood ramp fitting algorithm.
kmacdonald-stsci Sep 18, 2024
fe59fd6
Updating the calculations collapsing the rateints product to the rate…
kmacdonald-stsci Sep 18, 2024
d900a80
Updating the likelihood algorithm description documentation.
kmacdonald-stsci Sep 18, 2024
7a07043
Changing sum to nansum.
kmacdonald-stsci Sep 19, 2024
375f310
Updating the documentation for ramp fitting wrt the likelihood algori…
kmacdonald-stsci Sep 19, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions changes/278.apichange.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
[ramp_fitting] Add the likelihood algorithm to ramp fitting.
47 changes: 41 additions & 6 deletions docs/stcal/ramp_fitting/description.rst
Original file line number Diff line number Diff line change
Expand Up @@ -2,15 +2,14 @@ Description
===========

This step determines the mean count rate, in units of counts per second, for
each pixel by performing a linear fit to the data in the input file. The fit
is done using the "ordinary least squares" method.
The fit is performed independently for each pixel.
each pixel by performing a linear fit to the data in the input file. The default
method uses "ordinary least squares" based on the Fixsen fitting algorithm
described by
`Fixsen et al. (2011) <https://ui.adsabs.harvard.edu/abs/2000PASP..112.1350F>`_.

The count rate for each pixel is determined by a linear fit to the
cosmic-ray-free and saturation-free ramp intervals for each pixel; hereafter
this interval will be referred to as a "segment." The fitting algorithm uses an
'optimal' weighting scheme, as described by
`Fixsen et al. (2011) <https://ui.adsabs.harvard.edu/abs/2000PASP..112.1350F>`_.
this interval will be referred to as a "segment."

Segments are determined using
the 4-D GROUPDQ array of the input data set, under the assumption that the jump
Expand All @@ -19,6 +18,11 @@ saturation flags are found. Pixels are processed simultaneously in blocks
using the array-based functionality of numpy. The size of the block depends
on the image size and the number of groups.

There is a likelihood algorithm implemented based on Timothy Brandt's papers:
`Brandt (2024) <https://iopscience.iop.org/article/10.1088/1538-3873/ad38d9>`__
and
`Brandt (2024) <https://iopscience.iop.org/article/10.1088/1538-3873/ad38da>`__.

melanieclarke marked this conversation as resolved.
Show resolved Hide resolved
.. _ramp_output_products:

Output Products
Expand Down Expand Up @@ -317,3 +321,34 @@ that pixel will be flagged as JUMP_DET in the corresponding integration in the
"rateints" product. That pixel will also be flagged as JUMP_DET in the "rate"
product.

Likelihood Algorithm Details
----------------------------
As an alternative to the OLS algorithm, a likelihood algorithm can be selected
with the step argument ``--ramp_fitting.algorithm=LIKELY``. This algorithm has
its own algorithm for jump detection. The normal jump detection algorithm runs
by default, but can be skipped when selecting the LIKELY algorithm. This
algorithm removes jump detection flags and sets jump detection flags. This jump
detection algorithm requires a minimum of four (4) NGROUPS. If the LIKELY
algorithm is selected for data with NGROUPS less than four, the ramp fitting
algorithm is changed to OLS_C.

Each pixel is independently processed, but rather than operate on each
group/resultant directly, the likelihood algorithm is based on differences of
the groups/resultants :math:`d_i = r_i - r_{i-1}`. The model used to determine
the slope/countrate, :math:`a`, is:

.. math::
\chi^2 = ({\bf d} - a \cdot {\bf 1})^T C ({\bf d} - a \cdot {\bf 1}) \,,

Differentiating, setting to zero, then solving for :math:`a` results in

.. math::
a = ({\bf 1}^T C {\bf d})({\bf 1}^T C {\bf 1})^T \,,

The covariance matrix :math:`C` is a tridiagonal matrix, due to the nature of the
differences. Because the covariance matrix is tridiagonal, the computational
complexity reduces from :math:`O(n^3)` to :math:`O(n)`. To see the detailed
derivation and computations implemented, refer to the links above.
The Poisson and read noise computations are based on equations (27) and (28),
defining :math:`\alpha_i`, the diagonal of :math:`C`, and :math:`\beta_i`, the
off diagonal.
319 changes: 319 additions & 0 deletions src/stcal/ramp_fitting/likely_algo_classes.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,319 @@
import numpy as np
from scipy import special


class IntegInfo:
"""
Storage for the integration information for ramp fitting computations.
"""
def __init__(self, nints, nrows, ncols):
"""
Initialize output arrays.

Parameters
----------
nints : int
The number of integrations in the data.

nrows : int
The number of rows in the data.

ncols : int
The number of columns in the data.
"""
dims = (nints, nrows, ncols)
self.data = np.zeros(shape=dims, dtype=np.float32)

self.dq = np.zeros(shape=dims, dtype=np.uint32)

self.var_poisson = np.zeros(shape=dims, dtype=np.float32)
self.var_rnoise = np.zeros(shape=dims, dtype=np.float32)

self.err = np.zeros(shape=dims, dtype=np.float32)

def prepare_info(self):
"""
Arrange output arrays as a tuple, which the ramp fit step expects.
"""
return (self.data, self.dq, self.var_poisson, self.var_rnoise, self.err)

def get_results(self, result, integ, row):
"""
Capture the ramp fitting computation.

Parameters
----------
result : RampResult
Holds computed ramp fitting information.

integ : int
The current integration being operated on.

row : int
The current row being operated on.
"""
self.data[integ, row, :] = result.countrate
self.err[integ, row, :] = result.uncert
self.var_poisson[integ, row, :] = result.var_poisson
self.var_rnoise[integ, row, :] = result.var_rnoise


class RampResult:
def __init__(self):
"""
Contains the ramp fitting results.
"""
self.countrate = None
self.chisq = None
self.uncert = None
self.var_poisson = None
self.var_rnoise = None
self.weights = None

self.countrate_two_omit = None
self.chisq_two_omit = None
self.uncert_two_omit = None

self.countrate_one_omit = None
self.jumpval_one_omit = None
self.jumpsig_one_omit = None
self.chisq_one_omit = None
self.uncert_one_omit = None

def __repr__(self):
"""
Return string of information about the class.
"""
ostring = f"countrate = \n{self.countrate}"
ostring += f"\nchisq = \n{self.chisq}"
ostring += f"\nucert = \n{self.uncert}"
'''

Check warning on line 90 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L87-L90

Added lines #L87 - L90 were not covered by tests
ostring += f"\nweights = \n{self.weights}"

ostring += f"\ncountrate_two_omit = \n{self.countrate_two_omit}"
ostring += f"\nchisq_two_omit = \n{self.chisq_two_omit}"
ostring += f"\nuncert_two_omit = \n{self.uncert_two_omit}"

ostring += f"\ncountrate_one_omit = \n{self.countrate_one_omit}"
ostring += f"\njumpval_one_omit = \n{self.jumpval_one_omit}"
ostring += f"\njumpsig_one_omit = \n{self.jumpsig_one_omit}"
ostring += f"\nchisq_one_omit = \n{self.chisq_one_omit}"
ostring += f"\nuncert_one_omit = \n{self.uncert_one_omit}"
'''

return ostring

Check warning on line 104 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L104

Added line #L104 was not covered by tests

def fill_masked_reads(self, diffs2use):
"""
Mask groups to use for ramp fitting.

Replace countrates, uncertainties, and chi squared values that
kmacdonald-stsci marked this conversation as resolved.
Show resolved Hide resolved
are NaN because resultant differences were doubly omitted.
For these cases, revert to the corresponding values in with
fewer omitted resultant differences to get the correct values
without double-counting omissions.

This function replaces the relevant entries of
self.countrate_two_omit, self.chisq_two_omit,
self.uncert_two_omit, self.countrate_one_omit, and
self.chisq_one_omit in place. It does not return a value.

Parameters
----------
diffs2use : ndarray
A 2D array matching self.countrate_one_omit in shape with zero
for resultant differences that were masked and one for
differences that were not masked.
"""
# replace entries that would be nan (from trying to
# doubly exclude read differences) with the global fits.
omit = diffs2use == 0
ones = np.ones(diffs2use.shape)

self.countrate_one_omit[omit] = (self.countrate * ones)[omit]
self.chisq_one_omit[omit] = (self.chisq * ones)[omit]
self.uncert_one_omit[omit] = (self.uncert * ones)[omit]

omit = diffs2use[1:] == 0

self.countrate_two_omit[omit] = (self.countrate_one_omit[:-1])[omit]
self.chisq_two_omit[omit] = (self.chisq_one_omit[:-1])[omit]
self.uncert_two_omit[omit] = (self.uncert_one_omit[:-1])[omit]

omit = diffs2use[:-1] == 0

self.countrate_two_omit[omit] = (self.countrate_one_omit[1:])[omit]
self.chisq_two_omit[omit] = (self.chisq_one_omit[1:])[omit]
self.uncert_two_omit[omit] = (self.uncert_one_omit[1:])[omit]


class Covar:
"""
class Covar holding read and photon noise components of alpha and
beta and the time intervals between the resultant midpoints
"""
def __init__(self, readtimes):
"""
Compute alpha and beta, the diagonal and off-diagonal elements of
the covariance matrix of the resultant differences, and the time
intervals between the resultant midpoints.

Parameters
----------
readtimes : list
List of values or lists for the times of reads. If a list of
lists, times for reads that are averaged together to produce
a resultant.
"""
# Equations (4) and (11) in paper 1.
mean_t, tau, N, delta_t = self._compute_means_and_taus(readtimes)

self.delta_t = delta_t
self.mean_t = mean_t
self.tau = tau
self.Nreads = N

# Equations (28) and (29) in paper 1.
self._compute_alphas_and_betas(mean_t, tau, N, delta_t)

def _compute_means_and_taus(self, readtimes):
"""
Computes the means and taus of defined in EQNs 4 and 11 in paper 1.

Parameters
----------
readtimes : list
List of values or lists for the times of reads. If a list of
lists, times for reads that are averaged together to produce
a resultant.
"""
mean_t = [] # mean time of the resultant as defined in the paper
tau = [] # variance-weighted mean time of the resultant
N = [] # Number of reads per resultant

for times in readtimes:
mean_t += [np.mean(times)]

if hasattr(times, "__len__"):
# eqn 11
N += [len(times)]
k = np.arange(1, N[-1] + 1)
if False:
tau += [
1
/ N[-1] ** 2
* np.sum((2 * N[-1] + 1 - 2 * k) * np.array(times))
]
# tau += [(np.sum((2*N[-1] + 1 - 2*k)*np.array(times))) / N[-1]**2]
else:
length = N[-1]
tmp0 = (2 * length + 1) - (2 * k)
sm = np.sum(tmp0 * np.array(times))
tmp = sm / length**2
tau.append(tmp)
else:
tau += [times]
N += [1]

Check warning on line 216 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L215-L216

Added lines #L215 - L216 were not covered by tests

# readtimes is a list of lists, so mean_t is the list of each
# mean of each list.
mean_t = np.array(mean_t)
tau = np.array(tau)
N = np.array(N)
delta_t = mean_t[1:] - mean_t[:-1]

return mean_t, tau, N, delta_t

def _compute_alphas_and_betas(self, mean_t, tau, N, delta_t):
"""
Computes the means and taus defined in EQNs 28 and 29 in paper 1.

Parameters
----------
mean_t : ndarray
The means of the reads for each group.

tau : ndarray
Intermediate computation.

N : ndarray
The number of reads in each group.

delta_t : ndarray
The group differences of integration ramps.
"""
self.alpha_readnoise = (1 / N[:-1] + 1 / N[1:]) / delta_t**2
self.beta_readnoise = -1 / (N[1:-1] * delta_t[1:] * delta_t[:-1])

self.alpha_phnoise = (tau[:-1] + tau[1:] - 2 * mean_t[:-1]) / delta_t**2
self.beta_phnoise = (mean_t[1:-1] - tau[1:-1]) / (delta_t[1:] * delta_t[:-1])

def calc_bias(self, countrates, sig, cvec, da=1e-7):
"""
Calculate the bias in the best-fit count rate from estimating the covariance matrix.

Section 5 of paper 1.

Arguments:
Parameters
----------
countrates : ndarray
Array of count rates at which the bias is desired.

sig : float
Single read noise.

cvec : ndarray
Weight vector on resultant differences for initial estimation
of count rate for the covariance matrix. Will be renormalized
inside this function.

da : float
Fraction of the count rate plus sig**2 to use for finite difference
estimate of the derivative. Optional parameter. Default 1e-7.

Returns
-------
bias : ndarray
Bias of the best-fit count rate from using cvec plus the observed
resultants to estimate the covariance matrix.
"""
alpha = countrates[np.newaxis, :] * self.alpha_phnoise[:, np.newaxis]
alpha += sig**2 * self.alpha_readnoise[:, np.newaxis]
beta = countrates[np.newaxis, :] * self.beta_phnoise[:, np.newaxis]
beta += sig**2 * self.beta_readnoise[:, np.newaxis]

Check warning on line 284 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L281-L284

Added lines #L281 - L284 were not covered by tests

# we only want the weights; it doesn't matter what the count rates are.
n = alpha.shape[0]
z = np.zeros((len(cvec), len(countrates)))
result_low_a = fit_ramps(z, self, sig, countrateguess=countrates)

Check warning on line 289 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L287-L289

Added lines #L287 - L289 were not covered by tests

# try to avoid problems with roundoff error
da_incr = da * (countrates[np.newaxis, :] + sig**2)

Check warning on line 292 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L292

Added line #L292 was not covered by tests

dalpha = da_incr * self.alpha_phnoise[:, np.newaxis]
dbeta = da_incr * self.beta_phnoise[:, np.newaxis]
result_high_a = fit_ramps(z, self, sig, countrateguess=countrates + da_incr)

Check warning on line 296 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L294-L296

Added lines #L294 - L296 were not covered by tests
# finite difference approximation to dw/da

dw_da = (result_high_a.weights - result_low_a.weights) / da_incr

Check warning on line 299 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L299

Added line #L299 was not covered by tests

bias = np.zeros(len(countrates))
c = cvec / np.sum(cvec)

Check warning on line 302 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L301-L302

Added lines #L301 - L302 were not covered by tests

for i in range(len(countrates)):

Check warning on line 304 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L304

Added line #L304 was not covered by tests

C = np.zeros((n, n))
for j in range(n):
C[j, j] = alpha[j, i]
for j in range(n - 1):
C[j + 1, j] = C[j, j + 1] = beta[j, i]

Check warning on line 310 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L306-L310

Added lines #L306 - L310 were not covered by tests

bias[i] = np.linalg.multi_dot([c[np.newaxis, :], C, dw_da[:, i]])

Check warning on line 312 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L312

Added line #L312 was not covered by tests

sig_a = np.sqrt(

Check warning on line 314 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L314

Added line #L314 was not covered by tests
np.linalg.multi_dot([c[np.newaxis, :], C, c[:, np.newaxis]])
)
bias[i] *= 0.5 * (1 + special.erf(countrates[i] / sig_a / 2**0.5))

Check warning on line 317 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L317

Added line #L317 was not covered by tests

return bias

Check warning on line 319 in src/stcal/ramp_fitting/likely_algo_classes.py

View check run for this annotation

Codecov / codecov/patch

src/stcal/ramp_fitting/likely_algo_classes.py#L319

Added line #L319 was not covered by tests
Loading
Loading