Support instance masks (N,H,W) #2856

ashnair1 · 2024-03-25T09:14:28Z

Changes

Fixes #2855 related to #941

Type of change

🧪 Tests Cases
🐞 Bug fix (non-breaking change which fixes an issue)

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Did you update CHANGELOG in case of a major change?

tests/augmentation/test_container.py

kornia/augmentation/container/ops.py

shijianjian

Meanwhile, what we did for the boxes is to merge them all into one tensor, which is

kornia/kornia/geometry/boxes.py

Lines 20 to 33 in 46a5e40

    
           def _merge_box_list(boxes: list[torch.Tensor], method: str = "pad") -> tuple[torch.Tensor, list[int]]: 
        
               r"""Merge a list of boxes into one tensor.""" 
        
               if not all(box.shape[-2:] == torch.Size([4, 2]) and box.dim() == 3 for box in boxes): 
        
                   raise TypeError(f"Input boxes must be a list of (N, 4, 2) shaped. Got: {[box.shape for box in boxes]}.") 
        
               if method == "pad": 
        
                   max_N = max(box.shape[0] for box in boxes) 
        
                   stats = [max_N - box.shape[0] for box in boxes] 
        
                   output = torch.nn.utils.rnn.pad_sequence(boxes, batch_first=True) 
        
               else: 
        
                   raise NotImplementedError(f"`{method}` is not implemented.") 
        
               return output, stats

Not sure if we should do the same to the masks. Can anyone benchmark on which approach runs faster? Iteration or batching?

kornia/augmentation/container/ops.py

ashnair1 · 2024-03-29T19:34:44Z

Merging a list of tensors into tensors will not work for instance segmentation. This is because each image will have a different number of objects and so different number of masks. Because of this, they cannot be batched.

This is how it would look like:
Single sample: Image (C, H, W) and Mask (N, H, W)
Batched: Image (B, C, H, W) and Mask ([(N₁, H, W), (N₂, H, W) ... (N_b, H, W)])

N (no. of detections) varies from image to image, so they cannot be batched.

Refer to #2417 for a proper example of this use case.

shijianjian · 2024-03-29T19:47:02Z

Single sample: Image (C, H, W) and Mask (N, H, W)
Batched: Image (B, C, H, W) and Mask ([(N1, H, W), (N2, H, W) ... (Nb, H, W)])

I meant that

N_max = max(N1, ..., Nb)
padding = [N_max - N1, ..., N_max - Nb]
padded_masks = (B, N_max, H, W)

... # After augmentation

Unpad (B, N_max, H, W) => Mask ([(N1, H, W), (N2, H, W) ... (Nb, H, W)])

The current change is fine in this PR I think. We should benchmark the iterative and batching strategies to see how those methods perform. Maybe a Mask class, similar to Boxes, to handle these.

* Fix for shape error in transform_masks * Iterate over batch_prob in transform_list * Run ruff * Revert prev changes * Update test case

ashnair1 marked this pull request as draft March 25, 2024 09:15

ashnair1 marked this pull request as ready for review March 25, 2024 11:23

johnnv1 added feature request New feature or request module: augmentations labels Mar 25, 2024

johnnv1 requested a review from shijianjian March 25, 2024 23:57

johnnv1 reviewed Mar 25, 2024

View reviewed changes

tests/augmentation/test_container.py Outdated Show resolved Hide resolved

edgarriba approved these changes Mar 26, 2024

View reviewed changes

kornia/augmentation/container/ops.py Show resolved Hide resolved

ashnair1 added 5 commits March 26, 2024 12:17

Fix for shape error in transform_masks

3a4a804

Iterate over batch_prob in transform_list

5d2d6b3

Run ruff

a9ac39a

Revert prev changes

7eae57f

Update test case

4ebe26d

ashnair1 force-pushed the msk-dim-fix branch from 87571f3 to 4ebe26d Compare March 26, 2024 08:17

shijianjian requested changes Mar 29, 2024

View reviewed changes

kornia/augmentation/container/ops.py Show resolved Hide resolved

ashnair1 requested a review from shijianjian March 29, 2024 19:39

shijianjian approved these changes Mar 29, 2024

View reviewed changes

johnnv1 approved these changes Mar 29, 2024

View reviewed changes

shijianjian merged commit 2c761ee into kornia:main Apr 2, 2024
27 checks passed

ashnair1 deleted the msk-dim-fix branch April 2, 2024 13:49

ashnair1 mentioned this pull request Apr 3, 2024

Remove AugPipe microsoft/torchgeo#1978

Open

cjpurackal pushed a commit to cjpurackal/kornia that referenced this pull request May 18, 2024

Support instance masks (N,H,W) (kornia#2856)

8267c62

* Fix for shape error in transform_masks * Iterate over batch_prob in transform_list * Run ruff * Revert prev changes * Update test case

ashnair1 mentioned this pull request Jun 29, 2024

Bump kornia min version microsoft/torchgeo#2144

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support instance masks (N,H,W) #2856

Support instance masks (N,H,W) #2856

ashnair1 commented Mar 25, 2024 •

edited by johnnv1

Loading

shijianjian left a comment

ashnair1 commented Mar 29, 2024 •

edited

Loading

shijianjian commented Mar 29, 2024

	def _merge_box_list(boxes: list[torch.Tensor], method: str = "pad") -> tuple[torch.Tensor, list[int]]:
	r"""Merge a list of boxes into one tensor."""
	if not all(box.shape[-2:] == torch.Size([4, 2]) and box.dim() == 3 for box in boxes):
	raise TypeError(f"Input boxes must be a list of (N, 4, 2) shaped. Got: {[box.shape for box in boxes]}.")

	if method == "pad":
	max_N = max(box.shape[0] for box in boxes)
	stats = [max_N - box.shape[0] for box in boxes]
	output = torch.nn.utils.rnn.pad_sequence(boxes, batch_first=True)
	else:
	raise NotImplementedError(f"`{method}` is not implemented.")

	return output, stats

Support instance masks (N,H,W) #2856

Support instance masks (N,H,W) #2856

Conversation

ashnair1 commented Mar 25, 2024 • edited by johnnv1 Loading

Changes

Type of change

Checklist

shijianjian left a comment

Choose a reason for hiding this comment

ashnair1 commented Mar 29, 2024 • edited Loading

shijianjian commented Mar 29, 2024

ashnair1 commented Mar 25, 2024 •

edited by johnnv1

Loading

ashnair1 commented Mar 29, 2024 •

edited

Loading