ACTIN-1057: Filter NaN likelihoods from CUPPA when converting in ORANGE #569

kduyvesteyn · 2024-06-21T12:52:00Z

Have tested on data-vm and confirmed this ORANGE version creates a JSON file which can be ingested into ACTIN (no NaN exception anymore).

kzuberihmf · 2024-06-21T14:26:01Z

orange/src/main/java/com/hartwig/hmftools/orange/algo/cuppa/CuppaDataFactory.java

@@ -83,9 +79,13 @@ private static List<CuppaPrediction> extractSortedProbabilities(@NotNull CuppaPr
                    .altSjCohortClassifier(probabilitiesByClassifier.get(ClassifierName.ALT_SJ))
                    .build();

-            cuppaPredictionsOrangeFormat.add(prediction);
+            // If a classifier has no data for a specific cancer type we should remove it completely, see ACTIN-1057


Feels a bit unusual to refer to a ticket in code, unless its an open TODO that needs followup.

indeed I'd normally just include this sort of thing in the commit message. Future devs can annotate the code to see it.

Do note that I added a comment because the implementation choice may not be optimal. We could for instance also expose a wider CUPPA datamodel in ORANGE and leave to downstream how to handle NaN likelihoods, but considered that unnecessary at this point

kzuberihmf · 2024-06-21T14:34:19Z

orange/src/test/java/com/hartwig/hmftools/orange/algo/cuppa/CuppaDataFactoryTest.java

+        expectedPredictionsByCancerType.put(expectedPredictionMelanoma.cancerType(), expectedPredictionMelanoma);
+        expectedPredictionsByCancerType.put(expectedPredictionSkinOther.cancerType(), expectedPredictionSkinOther);
+        expectedPredictionsByCancerType.put(expectedPredictionProstate.cancerType(), expectedPredictionProstate);
+        expectedPredictionsByCancerType.put("Bone/Soft tissue: Cartilaginous neoplasm", null);


Where is the null setup in the test data? ... oh ok, I see that it was always there but its actually missing values in the input file from CUPPA and not explicit NaN's.

Correct, that's what tricked us in the first place :)

ACTIN-1057: Filter NaN likelihoods from CUPPA when converting in ORANGE

a0c98ae

kduyvesteyn requested review from pauldwolfe and kzuberihmf June 21, 2024 12:52

kzuberihmf approved these changes Jun 21, 2024

View reviewed changes

pauldwolfe approved these changes Jun 21, 2024

View reviewed changes

ACTIN-1057: Remove reference to ticket from comment

310f1ed

kduyvesteyn merged commit 65fbdf1 into master Jun 22, 2024

kduyvesteyn deleted the ACTIN-1057 branch June 22, 2024 08:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ACTIN-1057: Filter NaN likelihoods from CUPPA when converting in ORANGE #569

ACTIN-1057: Filter NaN likelihoods from CUPPA when converting in ORANGE #569

kduyvesteyn commented Jun 21, 2024

kzuberihmf Jun 21, 2024

pauldwolfe Jun 21, 2024

kduyvesteyn Jun 22, 2024

kduyvesteyn Jun 22, 2024

kzuberihmf Jun 21, 2024

kduyvesteyn Jun 22, 2024

ACTIN-1057: Filter NaN likelihoods from CUPPA when converting in ORANGE #569

ACTIN-1057: Filter NaN likelihoods from CUPPA when converting in ORANGE #569

Conversation

kduyvesteyn commented Jun 21, 2024

kzuberihmf Jun 21, 2024

Choose a reason for hiding this comment

pauldwolfe Jun 21, 2024

Choose a reason for hiding this comment

kduyvesteyn Jun 22, 2024

Choose a reason for hiding this comment

kduyvesteyn Jun 22, 2024

Choose a reason for hiding this comment

kzuberihmf Jun 21, 2024

Choose a reason for hiding this comment

kduyvesteyn Jun 22, 2024

Choose a reason for hiding this comment