Mph norm workflow #125

ngreenwald · 2022-06-07T22:54:05Z

If you haven't already, please read through our contributing guidelines before opening your PR

What is the purpose of this PR?

Adds functionality to the set_up_toffy notebook which allows the user to construct a tuning curve for their instrument. This takes in a detector sweep, and produces a tuning curve relating MPH to signal intensity
Creates the 4b_normalize_image_data notebook which walks the user through the normalization process. This assumes the user has already constructed a tuning curve, and will allow them to easily normalize their data
Switches the logic for normalization. Previously, we tried to fit a relationship between mass and MPH within an FOV, which would then allow us to use a small number of MPH values from selected masses to normalize. However, this created more problems than it solved. Instead, we now compute the MPH for every mass in the panel. This is more compute intensive, but it is quite fast and won't be a major bottleneck. We then fit a separate curve for each mass over the course of the run, modeling how the MPH is changing for that mass as a function of run length. This results in very smooth estimates of MPH for each marker in the panel. We then use those adjusted mass-specific MPH values to normalize the data.
Adds plotting and QC outputs to allow the user to visualize the curve fit, as well as get notified for any normalization values which are outside a proscribed range
Updates the 4a_rosetta notebook to have the same default paths as the rest of the notebooks. Also removes the nested for loop structure in favor of processing a single run, simplifying the code

Closes #37

…into mph_norm_workflow

* combine metrics doesn't require bins * helper function for formatting df * refactor saving function * fov helper function * refactored top level function * pycodestyle * use median for small datasets * update notebook * close plots * fix bug in df construction * sort df by channels * outlier identification * plot outliers separately * remove outlier functionality * change outlier detection * updated docstrings * make test more robust

review-notebook-app · 2022-06-07T22:54:09Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

ackagel

Looks good so far. It looks like a lot of my review comments in review-nb are in the works in normalize.py

toffy/normalize.py

ackagel · 2022-06-09T18:20:25Z

templates/1_set_up_toffy.ipynb

@@ -15,9 +15,10 @@
   "id": "e36293c5-aa89-4029-a3fa-e8ea841bb8b5",


Can't the sensitivity curve generation go in 4b, since it's only used there? Is the idea that putting this here will encourage people to run a sweep before data aq?

Reply via ReviewNB

No, it's because it only needs to happen once per instrument, not separately for each run. So having it here means it won't be present in the notebook each time people are normalizing

ackagel · 2022-06-09T18:20:25Z

templates/1_set_up_toffy.ipynb

@@ -15,9 +15,10 @@
   "id": "e36293c5-aa89-4029-a3fa-e8ea841bb8b5",


Can't this be done programmatically here?

Reply via ReviewNB

Sweeps aren't created in their own folder, they're just put into the main /Data folder. So you need to separately identify each FOV from the sweep, which are given the generic names. This was initially what that find_detector_sweeps function was for, but then Erin had a couple sweeps where FOVs were missing, so it gave an error.

We could change it so that it would give a warning when an FOV is missing, rather than an error, and then ask people to list the first FOV and last FOV of their sweep and it would find the rest, but at that point it started to feel like the solution was almost as complicated as the problem. Up to you though, it would be an easy change

templates/1_set_up_toffy.ipynb

templates/4b_normalize_image_data.ipynb

alex-l-kong

A few aesthetic comments mostly in the notebook.

toffy/normalize.py

templates/1_set_up_toffy.ipynb

templates/4b_normalize_image_data.ipynb

ngreenwald added 20 commits January 31, 2022 15:01

framework for channel counts function

972f9b3

testing for first function

85e6305

notebook

3d89a9e

Merge branch 'main' into mph_norm_workflow

cdd74d4

updated naming

72d3ec3

first part of full workflow

4080bd1

Merge branch 'main' into mph_norm_workflow

12e876c

move to FOV-based normalization scheme

75c746c

update workflow

ea1160b

fix tests

4ac8b66

add logging

6592a8a

standardize notebook naming

9f8726a

check for missing normalization function

2c7cb0d

pycodestyle

1c387ce

Update README.md

45bb0e3

style

2e145fe

Merge branch 'mph_norm_workflow' of https://github.com/angelolab/toffy …

beb8d3e

…into mph_norm_workflow

merge conflict

314514b

fixed merges

3847322

ngreenwald requested review from srivarra, alex-l-kong, ackagel and camisowers June 7, 2022 22:54

ackagel reviewed Jun 9, 2022

View reviewed changes

toffy/normalize.py Outdated Show resolved Hide resolved

toffy/normalize.py Outdated Show resolved Hide resolved

toffy/normalize.py Show resolved Hide resolved

ackagel reviewed Jun 9, 2022

View reviewed changes

alex-l-kong reviewed Jun 9, 2022

View reviewed changes

toffy/normalize.py Outdated Show resolved Hide resolved

alex-l-kong reviewed Jun 9, 2022

View reviewed changes

templates/1_set_up_toffy.ipynb Show resolved Hide resolved

templates/4b_normalize_image_data.ipynb Show resolved Hide resolved

code review comments

19fcd50

add more explanation to notebook on normalization

9eec96c

ngreenwald requested review from alex-l-kong and ackagel June 10, 2022 20:49

ngreenwald added 5 commits June 11, 2022 12:49

Merge branch 'main' into mph_norm_workflow

20cc284

typo in environment.yaml

550f630

switch default to 2nd degree polynomial

d1c397f

merge

c0c4781

update default value and testing

f0f32e0

ngreenwald merged commit e15f1d1 into main Jun 15, 2022

ngreenwald deleted the mph_norm_workflow branch June 15, 2022 23:36

camisowers added the enhancement New feature or request label Sep 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mph norm workflow #125

Mph norm workflow #125

ngreenwald commented Jun 7, 2022 •

edited

Loading

review-notebook-app bot commented Jun 7, 2022

ackagel left a comment

ackagel Jun 9, 2022

ngreenwald Jun 10, 2022

ackagel Jun 9, 2022

ngreenwald Jun 10, 2022

alex-l-kong left a comment

		@@ -15,9 +15,10 @@
		"id": "e36293c5-aa89-4029-a3fa-e8ea841bb8b5",

Mph norm workflow #125

Mph norm workflow #125

Conversation

ngreenwald commented Jun 7, 2022 • edited Loading

review-notebook-app bot commented Jun 7, 2022

ackagel left a comment

Choose a reason for hiding this comment

ackagel Jun 9, 2022

Choose a reason for hiding this comment

ngreenwald Jun 10, 2022

Choose a reason for hiding this comment

ackagel Jun 9, 2022

Choose a reason for hiding this comment

ngreenwald Jun 10, 2022

Choose a reason for hiding this comment

alex-l-kong left a comment

Choose a reason for hiding this comment

ngreenwald commented Jun 7, 2022 •

edited

Loading