dwelltime: add standard errors based on hessian approximation #690

JoepVanlier · 2024-08-19T13:15:09Z

Why this PR?
For some purposes, we need a quantity that calculates an approximate standard error rapidly.
Asymptotics are useful especially when the problem is well constrained (lots of data and a well-suited model).

2-component fit based on 100 dwells. Profile likelihood compared to asymptotic intervals. Confidence intervals would be given by the locations where the curves meet the threshold (dashed horizontal line).

2-component fit based on 1000 dwells. Profile likelihood compared to asymptotic intervals. Confidence intervals would be given by the locations where the curves meet the threshold (dashed horizontal line). Note how for lots of data, these two approaches produce almost identical results.

I have added the ability to plot them alongside the profiles, as it can be instructive to see how these type of errors compare. I think in the future, it would also be useful to explore this on the FdFitting side as well.

Small note on the implementation:

It is important to consider the constraint on the amplitudes when doing this uncertainty analysis. Without it, you get huge confidence intervals for alpha (since the problem is underdertermined). There are two ways of doing this. One you add entries to the Hessian corresponding to the constraint or you transform the derivatives into a subspace that fulfills the constraint, invert there and convert back. I chose the latter, since otherwise, we would have to choose how heavily we set this constraint constant. In the plot below you can see what varying the constraint does. You can basically see it converge to the solution we have now.

Comparing the current approach (dash-dot) with an approach where we take into account the effect of the constraint by adding values to components of the Hessian manually. Note how the approaches agree for large values of C.

The risk with the constant based one is that if you choose the constant too large, it blows up (see figure below), whereas if you choose it too small you don't take the constraint into account sufficiently.

Comparing the current approach (dash-dot) with an approach where we take into account the effect of the constraint by adding values to components of the Hessian manually. Note how excessively large values result in numerical problems.

Considering @rpauszek is working on the dwell time documentation at the moment, I have deferred writing documentation for this specifically at this time.

When calculating the asymptotic uncertainty interval, we should take into account that we actually impose a linear sum constraint (otherwise, the amplitudes will have indeterminate confidence intervals). One could add the constraint explicitly to the Hessian by simply adding a large penalty term to the relevant derivatives. Considering that the sum constraint is of the form (1 - sum(a_i)) ** 2, this would result in adding a constant term d^2f/daidaj = -c with c large to all amplitude terms. What is ugly is that we would need to choose this constant as large as possible without incurring numerical issues. This is why it is preferable to project onto the null space and then calculate the result back instead.

JoepVanlier force-pushed the dwell_hess branch 5 times, most recently from 63822e7 to d001108 Compare August 19, 2024 14:08

JoepVanlier marked this pull request as ready for review September 3, 2024 14:54

JoepVanlier requested review from a team as code owners September 3, 2024 14:54

JoepVanlier requested review from tommasogritti, tobiasjj and rpauszek September 3, 2024 14:54

JoepVanlier force-pushed the dwell_hess branch 2 times, most recently from 8aaa9df to fcc014a Compare September 5, 2024 15:12

JoepVanlier force-pushed the dwell_hess branch from fcc014a to 0d69df5 Compare October 7, 2024 14:16

JoepVanlier force-pushed the dwell_hess branch from 0d69df5 to fdf4541 Compare October 18, 2024 15:08

JoepVanlier added 2 commits October 29, 2024 22:02

profiles: provide the option to plot w/stderr

4776d86

JoepVanlier force-pushed the dwell_hess branch from fdf4541 to 4776d86 Compare October 29, 2024 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dwelltime: add standard errors based on hessian approximation #690

dwelltime: add standard errors based on hessian approximation #690

JoepVanlier commented Aug 19, 2024 •

edited

Loading

dwelltime: add standard errors based on hessian approximation #690

Are you sure you want to change the base?

dwelltime: add standard errors based on hessian approximation #690

Conversation

JoepVanlier commented Aug 19, 2024 • edited Loading

JoepVanlier commented Aug 19, 2024 •

edited

Loading