✨ Auto-populate Sample, Container on Biospecimen create/update #645

znatty22 · 2024-01-23T19:02:03Z

Motivation

#643 introduced the Sample and Container tables in order to address the shortcomings of the Biospecimen table. Now we need a way to populate these tables. And since a sample and container may be derived from a biospecimen, we can auto-populate them.

Approach

Each time a biospecimen is created or updated via an HTTP POST/PATCH, derive the sample and container from the input biospecimen and update sample/container tables.

Sample, Container Management

Find Sample - check if a sample already exists for this biospecimen
- Use a specific set of biospecimen attributes to uniquely identify the sample:
- sample_event_key = concat(participant_id, external_sample_id, age_at_event_days)
- analyte_type
- composition
- source_text_tissue_type
- source_text_anatomic_site
- preservation_method
- method_of_sample_procurement
- concentration_mg_per_ml
Create Sample - if the sample does not exist - create it using the relevant subset of biospecimen attributes
- All parameters above, plus:
- participant_id
- external_sample_id
- volume_ul
Update Sample - if the sample exists - update it using the relevant subset of biospecimen attributes
Find Container - check if a container already exists for this biospecimen
- Use a specific set of biospecimen attributes to uniquely identify the container:
- biospecimen_id
Create Container - if the container does not exist - create it using the relevant subset of biospecimen attributes
- All parameters above, plus:
- sample_id
- specimen_status
- volume_ul
Update Container - if the container exists - update it using the relevant subset of biospecimen attributes
Sum Volume - update the the sample's volume_ul field with the sum of it's container volumes

calkinsh · 2024-01-24T15:04:05Z

dataservice/api/biospecimen/manager.py

+    params = _get_sample_identifier(biospecimen)
+    # Add remaining sample attributes
+    params.update(
+        {


everything in this PR makes sense to me except the idea that we might update a participant ID. I think participant ID should be part of the defining characteristics of a sample so I'm struggling to understand how we could both identify an existing sample (which implies the participant ID on the sample matches that on the specimen being registered) but then update the sample participant ID field (which implies the participant ID does not match the specimen being registered).

Oh hold on... is this related to participant.kf_id really being the primary ID for particpant and participant_id being a sort of secondary/external ID? So we are updating the external ID if it changes but relying on the kf_id/PK for confirming the sample already exists?

@calkinsh Yep, the primary key for participant is participant.kf_id and the sample has a foreign key to it sample.participant_id so I think that does make it a defining characteristic of the sample.

Technically we may not need the Sample.participant_id or Sample.external_id bc they are captured in the Sample.sample_event_key but I included them in the Sample table in case we want to populate the sample event key with something else and bc I felt it would be ok to have some redundancy to gain some clarity on which participant the sample came from and what the original biospecimen's external sample ID was

znatty22 · 2024-01-31T16:49:41Z

dataservice/api/biospecimen/manager.py

+    return container
+
+
+def _upsert_sample(biospecimen):


Need to change this approach. Read, modify, write is an anti-pattern and doesn't work with concurrent requests. Use postgresql internal upsert (update on conflict)

znatty22 · 2024-02-01T18:53:57Z

Closing for now. New approach is to implement the Sample table only. This is an MVP to meet Portal Beta requirements. Will try to autopopulate the Sample table from Biospecimens similar to approach here

znatty22 added the feature New functionality label Jan 23, 2024

znatty22 self-assigned this Jan 23, 2024

znatty22 force-pushed the populate-sample-container branch 2 times, most recently from 9200f36 to a3f4ca4 Compare January 23, 2024 19:22

znatty22 added 2 commits January 23, 2024 14:24

✨ Add Biospecimen manager to create/update samp/ctn in API

c390603

♻️ Create/update sample, container on biosp create/update

f342275

znatty22 force-pushed the populate-sample-container branch from a3f4ca4 to f342275 Compare January 23, 2024 19:25

znatty22 changed the title ~~✨ Auto-populate Sample and Container on Biospecimen create/update~~ ✨ Auto-populate Sample, Container on Biospecimen create/update Jan 23, 2024

♻️ Only use concentration if analyte_type is DNA, RNA

e3de58f

znatty22 mentioned this pull request Jan 23, 2024

✨ Add Sample and Container #643

Closed

2 tasks

znatty22 marked this pull request as ready for review January 23, 2024 21:21

znatty22 requested a review from a team as a code owner January 23, 2024 21:21

znatty22 marked this pull request as draft January 23, 2024 21:21

znatty22 requested a review from calkinsh January 23, 2024 21:22

calkinsh reviewed Jan 24, 2024

View reviewed changes

calkinsh approved these changes Jan 24, 2024

View reviewed changes

znatty22 added 3 commits January 24, 2024 13:44

🐛 Check for orphaned samples on ctner delete and update

08c07da

🐛 Return updated sample from bs smp cnter manager

96bf11a

✅ Test sample/container management on bs create/update

0553e37

znatty22 force-pushed the populate-sample-container branch from dfa88d7 to 1074988 Compare January 25, 2024 19:49

✅ Fix broken tests - use sample/container mgr to create them

b9aa356

znatty22 force-pushed the populate-sample-container branch from 1074988 to b9aa356 Compare January 25, 2024 21:21

znatty22 commented Jan 31, 2024

View reviewed changes

znatty22 closed this Feb 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Auto-populate Sample, Container on Biospecimen create/update #645

✨ Auto-populate Sample, Container on Biospecimen create/update #645

znatty22 commented Jan 23, 2024 •

edited

Loading

calkinsh Jan 24, 2024

calkinsh Jan 24, 2024

znatty22 Jan 24, 2024

znatty22 Jan 24, 2024 •

edited

Loading

znatty22 Jan 31, 2024

znatty22 commented Feb 1, 2024

✨ Auto-populate Sample, Container on Biospecimen create/update #645

✨ Auto-populate Sample, Container on Biospecimen create/update #645

Conversation

znatty22 commented Jan 23, 2024 • edited Loading

Motivation

Approach

Sample, Container Management

calkinsh Jan 24, 2024

Choose a reason for hiding this comment

calkinsh Jan 24, 2024

Choose a reason for hiding this comment

znatty22 Jan 24, 2024

Choose a reason for hiding this comment

znatty22 Jan 24, 2024 • edited Loading

Choose a reason for hiding this comment

znatty22 Jan 31, 2024

Choose a reason for hiding this comment

znatty22 commented Feb 1, 2024

znatty22 commented Jan 23, 2024 •

edited

Loading

znatty22 Jan 24, 2024 •

edited

Loading