Skip to content

Commit

Permalink
feat: dicom-finder cli tool, and improve documentation (#142)
Browse files Browse the repository at this point in the history
* docs: add mkdocs-include-markdown plugin for Markdown file inclusion

* docs: add tag helpers documentation for DICOM utilities

* docs: add documentation for finding DICOM files

* Update src/imgtools/cli/dicomfind.py

* feat: add regex filtering and improved logging for dicom_finder function

* chore: exclude CLI files from coverage reporting
  • Loading branch information
jjjermiah authored Nov 22, 2024
1 parent 359cc09 commit 2981baa
Show file tree
Hide file tree
Showing 13 changed files with 380 additions and 1,061 deletions.
5 changes: 3 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
# Med-Imagetools: Transparent and Reproducible Medical Image Processing Pipelines in Python
<!--intro-start-->

[![CI/CD Status](https://github.com/bhklab/med-imagetools/actions/workflows/main.yml/badge.svg)](https://github.com/bhklab/med-imagetools/actions/workflows/main.yml)
![GitHub repo size](https://img.shields.io/github/repo-size/bhklab/med-imagetools)
Expand All @@ -14,7 +15,7 @@
[![PyPI - Format](https://img.shields.io/pypi/format/med-imagetools)](https://pypi.org/project/med-imagetools/)
[![Downloads](https://static.pepy.tech/badge/med-imagetools)](https://pepy.tech/project/med-imagetools)

#### Med-ImageTools core features
## Med-ImageTools core features

* AutoPipeline CLI
* `nnunet` nnU-Net compatibility mode
Expand Down Expand Up @@ -64,7 +65,7 @@ pip install -e git+https://github.com/bhklab/med-imagetools.git
```

This will install the package in editable mode, so that the installed package will update when the code is changed.

<!--intro-end-->
## Latest Updates Nov 21st, 2024

### New CLI entry point `imgtools`
Expand Down
1 change: 1 addition & 0 deletions config/coverage.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
[tool.coverage.run]
omit = [
"src/imgtools/logging/**/*.py",
"src/imgtools/cli/**/*.py",
]

[tool.coverage.report]
3 changes: 2 additions & 1 deletion config/mypy.ini
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
[mypy]
files =
src/imgtools/logging/**/*.py,
src/imgtools/dicom/**/*.py
src/imgtools/dicom/**/*.py,
src/imgtools/cli/**/*.py
exclude =
tests/
106 changes: 1 addition & 105 deletions docs/index.md
Original file line number Diff line number Diff line change
@@ -1,107 +1,3 @@
# Med-Imagetools: Transparent and Reproducible Medical Image Processing Pipelines in Python

[![main-ci](https://github.com/bhklab/med-imagetools/actions/workflows/main-ci.yml/badge.svg)](https://github.com/bhklab/med-imagetools/actions/workflows/main-ci.yml)
![GitHub repo size](https://img.shields.io/github/repo-size/bhklab/med-imagetools)
![GitHub contributors](https://img.shields.io/github/contributors/bhklab/med-imagetools)
![GitHub stars](https://img.shields.io/github/stars/bhklab/med-imagetools?style=social)
![GitHub forks](https://img.shields.io/github/forks/bhklab/med-imagetools?style=social)

## Latest Updates (v0.4.4) - July 27th, 2022

New features include:

* AutoPipeline CLI
* `nnunet` nnU-Net compatibility mode
* Built-in train/test split for both normal/nnU-Net modes
* `random_state` for reproducible seeds
* Region of interest (ROI) yaml dictionary intake for RTSTRUCT processing
* Markdown report output post-processing
* `continue_processing` flag to continue autopipeline
* `dry_run` flag to only crawl the dataset

Med-Imagetools, a python package offers the perfect tool to transform messy medical dataset folders to deep learning ready format in few lines of code. It not only processes DICOMs consisting of different modalities (like CT, PET, RTDOSE and RTSTRUCTS), it also transforms them into deep learning ready subject based format taking the dependencies of these modalities into consideration.

## Introduction

A medical dataset, typically contains multiple different types of scans for a single patient in a single study. As seen in the figure below, the different scans containing DICOM of different modalities are interdependent on each other. For making effective machine learning models, one ought to take different modalities into account.

<img src="images/graph.png" align="center" width="480" ><figcaption>Fig.1 - Different network topology for different studies of different patients</figcaption></a>

Med-Imagetools is a unique tool, which focuses on subject based Machine learning. It crawls the dataset and makes a network by connecting different modalities present in the dataset. Based on the user defined modalities, med-imagetools, queries the graph and process the queried raw DICOMS. The processed DICOMS are saved as nrrds, which med-imagetools converts to torchio subject dataset and eventually torch dataloader for ML pipeline.

<img src="images/autopipeline.png" align="center" width="500"><figcaption>Fig.2 - Med-Imagetools AutoPipeline diagram</figcaption></a>

## Installing med-imagetools

```sh
pip install med-imagetools
```

### (recommended) Create new conda virtual environment

``` sh
conda create -n mit
conda activate mit
pip install med-imagetools
```

### (optional) Install in development mode

``` sh
conda create -n mit
conda activate mit
pip install -e git+https://github.com/bhklab/med-imagetools.git
```

This will install the package in editable mode, so that the installed package will update when the code is changed.

## Getting Started

Med-Imagetools takes two step approch to turn messy medical raw dataset to ML ready dataset.

1. ***Autopipeline***: Crawls the raw dataset, forms a network and performs graph query, based on the user defined modalities. The relevant DICOMS, get processed and saved as nrrds

``` sh
autopipeline\
[INPUT_DIRECTORY] \
[OUTPUT_DIRECTORY] \
--modalities [str: CT,RTSTRUCT,PT] \
--spacing [Tuple: (int,int,int)]\
--n_jobs [int]\
--visualize [flag]\
--nnunet [flag]\
--train_size [float]\
--random_state [int]\
--roi_yaml_path [str]\
--continue_processing [flag]\
--dry_run [flag]
```

2. ***class Dataset***: This class converts processed nrrds to torchio subjects, which can be easily converted to torch dataset

```py
from imgtools.io import Dataset
subjects = Dataset.load_from_nrrd(output_directory, ignore_multi=True)
data_set = tio.SubjectsDataset(subjects)
data_loader = torch.utils.data.DataLoader(data_set, batch_size=4, shuffle=True, num_workers=4)
```


### Contributors

Thanks to the following people who have contributed to this project:

* [@mkazmier](https://github.com/mkazmier)
* [@skim2257](https://github.com/skim2257)
* [@fishingguy456](https://github.com/fishingguy456)
* [@Vishwesh4](https://github.com/Vishwesh4)
* [@mnakano](https://github.com/mnakano)

## Contact

If you have any questions/concerns, you can reach the main development team at sejin.kim@uhnresearch.ca or open an issue on our [GitHub repository](https://github.com/bhklab/med-imagetools)

## License

This project uses the following license: [Apache License 2.0](http://www.apache.org/licenses/)
{% include-markdown "../README.md" start="<!--intro-start-->" end="<!--intro-end-->" %}
3 changes: 3 additions & 0 deletions docs/reference/dicom-utils/find-dicoms.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# Find DICOMs

::: imgtools.dicom.utils.find_dicoms
7 changes: 7 additions & 0 deletions docs/reference/dicom-utils/tag-helpers.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
# Tag Helpers

::: imgtools.dicom.utils.lookup_tag

::: imgtools.dicom.utils.tag_exists

::: imgtools.dicom.utils.similar_tags
1 change: 1 addition & 0 deletions mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -51,6 +51,7 @@ plugins:
- awesome-pages # simplifies configuring page titles and their order
- search # necessary for search functionality to work
- git-authors # adds authors to pages using git history
- include-markdown # allows for including Markdown files into another Markdown file
- mkdocstrings:
handlers:
python:
Expand Down
Loading

0 comments on commit 2981baa

Please sign in to comment.