refactoring the image writer modules #3595

wyli · 2022-01-06T12:02:35Z

This is a follow-up on the image writer of the previous I/O module proposals.

Goal and anticipated benefits

The current image writing module has been primarily built around the NIfTI specifications using the nibabel package as the backend.
The goal is to redesign the module in order to decouple the universal and the backend-specific implementations.

This will make it easier to bring in various writer backends such as ITK-Python, therefore supporting different image formats and more powerful writer module customisations.

related tickets: #2620 #2613 Project-MONAI/MONAILabel#211
proof of concept #3443

Details

The writer module should have (1) universal logic (2) backend-specific logic (3) mechanism for defaulting/selecting backends. Specifically:

(1) In the context of image writing, the universal logic handles the image-related outputs from the deep learning workflows, such as:

fetching data array and metadata
arrangements/resampling of spatial/channel dimensions
preparing target data types
preparing output folder structures, potentially compatible with BIDS (brain imaging data structure) and NDWB (neurodata without boarders)

(2) The backend-specific logic includes

creating data representations according to the backend APIs
calling the backend APIs to finish the data writing

(3) The backend selection logic operates based on the user-specified parameters, backend system availability, and current system default configurations.

wyli · 2022-01-12T17:58:10Z

#3443 is now a working POC, it provides (essential new features in monai/data/image_writer.py):

An ImageWriter base class with backend-agnostic utilities such as resample_if_needed
Subclasses (PILWriter, ITKWriter, NibabelWriter) with backend-specific logic, implementing
- set_data_array
- set_metadata
- write
register_writer, resolve_writer so that it selects a suitable writer based on the file extension name
FolderLayout for generating organised filenames
Revised existing SaveImage transform to use the new writer module.

they are non-breaking changes, the primary usage will be:

MONAI/monai/transforms/io/array.py

Lines 353 to 356 in 3206fa6

    
           writer_cls(output_dtype=self.output_dtype, scale=self.scale) 
        
           .set_data_array(img, channel_dim=0, squeeze_end_dims=self.squeeze_end_dims) 
        
           .set_metadata(meta_dict=meta_data, resample=self.resample, **self.resample_dict) 
        
           .write(filename, verbose=self.print_log)

and has the basic support for .nrrd and .dcm writing:

MONAI/tests/test_save_image.py

Lines 26 to 32 in 3206fa6

    
           TEST_CASE_3 = [torch.randint(0, 255, (1, 2, 3, 4)), {"filename_or_obj": "testfile0.nrrd"}, ".nrrd", False] 
        
           TEST_CASE_4 = [ 
        
               np.random.randint(0, 255, (3, 2, 4, 5), dtype=np.uint8), 
        
               {"filename_or_obj": "testfile0.dcm"}, 
        
               ".dcm", 
        
               False,

Please let me know any high-level comments, then I'll revise the design and submit smaller PRs to add these components into the core codebase.

After these I believe we can deprecate png_writer, png_saver, nifti_writer, nifti_saver.

cc @ericspod @rijobro @Nic-Ma @Project-MONAI/core-reviewers

Nic-Ma · 2022-01-13T03:59:18Z

Thanks @wyli for the great work to totally enhance our IO part!
The usage API is interesting:

writer_cls(output_dtype=self.output_dtype, scale=self.scale) 
.set_data_array(img, channel_dim=0, squeeze_end_dims=self.squeeze_end_dims) 
.set_metadata(meta_dict=meta_data, resample=self.resample, **self.resample_dict) 
.write(filename, verbose=self.print_log)

I am not sure whether this "Java style" return self is also common practice in python world? @ericspod

Thanks.

wyli · 2022-01-13T09:53:07Z

thanks, it is possible to have a basic approach with the input arguments in one place:

write_cls.write(data, meta_dict, channel_dim, resample, filename, ...)

but

the argument list will be very long
it'll be difficult to modify/extend the subroutines within this function

the current approach divide the routine into set_data_arary, set_metadata, write parts. I feel it's flexible enough, but I'm happy to change or experiment with any other ideas...

wyli · 2022-01-14T12:17:06Z

may want to support metadata from existing objects

img = nibabel.load('img')
writer.set_metadata(img)

this is not implemented but should be feasible without changing the API

ericspod · 2022-01-18T18:19:34Z

Thanks @wyli for the great work to totally enhance our IO part! The usage API is interesting:
writer_cls(output_dtype=self.output_dtype, scale=self.scale) 
.set_data_array(img, channel_dim=0, squeeze_end_dims=self.squeeze_end_dims) 
.set_metadata(meta_dict=meta_data, resample=self.resample, **self.resample_dict) 
.write(filename, verbose=self.print_log) 
I am not sure whether this "Java style" return self is also common practice in python world? @ericspod

Thanks.

It's rare in Python, it's more of a C++ thing I felt. I'm not totally sure it's a good pattern or not. A lot of the use cases for this involve making up for a shortcoming in a language which would otherwise force a cumbersome chain of method calls in separate statements. It makes sense when you want to provide an API for a configurable sequence of operations. Other language mechanisms like variable arguments or operator overload can be used instead (C++'s use of << in streams for example). Pytorch does use it to chain together inplace operations on tensors (add_, sub_) to closely replicate the structure of an equivalent expression using operators, that makes sense since Python doesn't provide explicit inplace operator expressions.

Here since any other use beyond these methods being called in this specific sequence isn't probably going to arise with these writers we could just have the call sequence as normal and not worry about returning self:

writer_obj = writer_cls(output_dtype=self.output_dtype, scale=self.scale) 
writer_obj.set_data_array(img, channel_dim=0, squeeze_end_dims=self.squeeze_end_dims) 
writer_obj.set_metadata(meta_dict=meta_data, resample=self.resample, **self.resample_dict) 
writer_obj.write(filename, verbose=self.print_log)

ericspod · 2022-01-18T18:24:43Z

I added a few comments for #3443 but otherwise I think it's in the right direction. I see that code has been moved to #3674 so the comments can be ported over.

wyli · 2022-01-18T21:30:34Z

thanks, I'll update #3674 #3443 to address these comments.

removed return in c6d8a9f

wyli · 2022-02-11T11:54:10Z

with the latest monai.transform.SaveImage now it's possible to have:

from monai.data import ITKReader, ITKWriter
from monai.transforms import LoadImage, SaveImage

loader = LoadImage(reader=ITKReader)
data, meta = loader("avg152T1_LR_nifti.nii.gz")
saver = SaveImage(output_ext=".nrrd", writer=ITKWriter)
saver(data, meta)

if you are still interested in converting the format in Project-MONAI/MONAILabel#211 @SachidanandAlle @diazandr3s

SachidanandAlle · 2022-02-11T15:03:13Z

We can start using this from core api directly.. we were waiting for this be available

wyli added this to the image writer backend (support of ITK python) [P0] milestone Jan 6, 2022

wyli added this to MONAI 0.9 Jan 6, 2022

wyli added Feature request Module: data datasets, readers/writers, synthetic data Module: transform data transforms for preprocessing and postprocessing. labels Jan 6, 2022

wyli mentioned this issue Jan 12, 2022

[poc] 2620 - image writer #3443

Closed

7 tasks

wyli mentioned this issue Jan 13, 2022

3595 - adds a folder layout class #3655

Merged

7 tasks

wyli mentioned this issue Jan 18, 2022

3595 3766 adds a base writer and an itk writer #3674

Merged

7 tasks

deepib moved this to In Progress in MONAI 0.9 Jan 19, 2022

This was referenced Feb 5, 2022

3595 Adds nibabel/pil writers #3772

Merged

2620 3595 Writer backend selector, deprecating nifti_saver/writer, png_saver/writer #3773

Merged

wyli closed this as completed in #3773 Feb 10, 2022

Repository owner moved this from In Progress to Done in MONAI 0.9 Feb 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactoring the image writer modules #3595

refactoring the image writer modules #3595

wyli commented Jan 6, 2022 •

edited

Loading

wyli commented Jan 12, 2022 •

edited

Loading

Nic-Ma commented Jan 13, 2022

wyli commented Jan 13, 2022 •

edited

Loading

wyli commented Jan 14, 2022

ericspod commented Jan 18, 2022 •

edited

Loading

ericspod commented Jan 18, 2022

wyli commented Jan 18, 2022 •

edited

Loading

wyli commented Feb 11, 2022

SachidanandAlle commented Feb 11, 2022

refactoring the image writer modules #3595

refactoring the image writer modules #3595

Comments

wyli commented Jan 6, 2022 • edited Loading

Goal and anticipated benefits

Details

wyli commented Jan 12, 2022 • edited Loading

Nic-Ma commented Jan 13, 2022

wyli commented Jan 13, 2022 • edited Loading

wyli commented Jan 14, 2022

ericspod commented Jan 18, 2022 • edited Loading

ericspod commented Jan 18, 2022

wyli commented Jan 18, 2022 • edited Loading

wyli commented Feb 11, 2022

SachidanandAlle commented Feb 11, 2022

wyli commented Jan 6, 2022 •

edited

Loading

wyli commented Jan 12, 2022 •

edited

Loading

wyli commented Jan 13, 2022 •

edited

Loading

ericspod commented Jan 18, 2022 •

edited

Loading

wyli commented Jan 18, 2022 •

edited

Loading