diagnose matrices not aligned with data #77

rmnldwg · 2024-03-05T08:44:57Z

Because the diagnose matrices are computed separately for each T-stage, they are not aligned with the patient data stored in the model anymore.

A solution could be to store the diagnose matrices in the patient data DataFrame and filter that by T-stage when model.diagnose_matrices[t_stage] is called.

This has benefits for both the Bayesian network implementation and the mixture model. And if I didn't overlook anything, this should be possible without breaking changes.

The text was updated successfully, but these errors were encountered:

The last change caused a dramatic slowdown (factor 500) of the data and diagnose matrix access, because it needed to index them from a `DataFrame`. Now, I implemented a basic caching scheme with a patient data cache version that brought back the original speed. Related: #77

The data and diagnose matrix is now computed on demand in the getter for the `patient_data` property. This makes sure that the user always sees an up to date version of the data encoding & the diagnose probabilities. Also, apparently `del dataframe[column]` is much slower than `dataframe.drop(columns)`. I replaced the former with the latter and now the tests are fast again. Related: #77

Since we now have access to the full diagnose matrix by default, there is no need for the Bayesian network T-stage fix anymore. Related: #77

rmnldwg added feature New feature or request code quality Improvements w.r.t. readability of code & best practices of coding 1.0 release Issues to fix before 1.0 release labels Mar 5, 2024

rmnldwg added this to the version 1.0.0 milestone Mar 5, 2024

rmnldwg self-assigned this Mar 5, 2024

rmnldwg added a commit that referenced this issue Mar 5, 2024

fix: don't use fake T-stage for BN model

29fd453

Since we now have access to the full diagnose matrix by default, there is no need for the Bayesian network T-stage fix anymore. Related: #77

rmnldwg closed this as completed in dcd745f Mar 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

diagnose matrices not aligned with data #77

diagnose matrices not aligned with data #77

rmnldwg commented Mar 5, 2024

diagnose matrices not aligned with data #77

diagnose matrices not aligned with data #77

Comments

rmnldwg commented Mar 5, 2024