🐞fix support for non-square images #204

ashwinvaidya17 · 2022-04-07T11:51:30Z

Description

Computes padding based on the image size. Might not be the best approach.
Fixes GANomaly breaks when changing image size #136

Changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist

My code follows the pre-commit style and check guidelines of this project.
I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing tests pass locally with my changes

djdameln · 2022-04-08T16:12:49Z

anomalib/models/ganomaly/torch_model.py

@@ -40,13 +42,14 @@ def __init__(
    ):
        super().__init__()

-        assert input_size % 16 == 0, "Input size should be a multiple of 16"
+        assert input_size[0] % 16 == 0 and input_size[1] % 16 == 0, "Input size should be a multiple of 16"


I guess this is no longer needed now that we use padding to make sure that the network can process any image size.

djdameln · 2022-04-08T16:21:12Z

anomalib/models/ganomaly/torch_model.py

+        """
+        # find the largest dimension
+        l_dim = 2 ** math.ceil(math.log(max(*input_size), 2))
+        padding = math.ceil((l_dim - input_size[0]) // 2 + 1), math.ceil((l_dim - input_size[1]) // 2 + 1)


This won't work for odd image size, e.g. 221

djdameln · 2022-04-11T14:58:04Z

anomalib/models/ganomaly/torch_model.py

        return torch.mean(torch.pow((latent_i - latent_o), 2), dim=1).view(-1)  # convert nx1x1 to n
+
+    def get_padded_tensor(self, batch: Tensor) -> Tensor:


Minor issue, but I don't like this name. Maybe apply_padding or pad_inputs? I also think the method could be static.

I'll update it in a bit. Static is a good idea. In that case would it make sense to move it to data/utils/image.py?

Yes, I think it would make sense to move it to a shared location so that other modules could access it as well. In that case we should maybe name it pad_nextpow2 or similar

djdameln

Thanks! Just one minor comment left

djdameln · 2022-04-12T10:39:41Z

anomalib/data/utils/image.py

+    """Compute required padding from input size and return padded images.
+
+    Finds the largest dimension and computes a square image to pass into the model. In case the image dimension


Maybe mention in the description that the function pads the images so that both sides are a power of 2.

🐞fix support for non-square images

27ef852

ashwinvaidya17 added the Bug Something isn't working label Apr 7, 2022

ashwinvaidya17 requested review from samet-akcay and djdameln April 7, 2022 11:51

djdameln requested changes Apr 8, 2022

View reviewed changes

Pad batch

3b903eb

samet-akcay requested a review from djdameln April 11, 2022 14:52

djdameln reviewed Apr 11, 2022

View reviewed changes

Move pad function to utils

38d1fb9

samet-akcay requested a review from djdameln April 12, 2022 10:06

djdameln requested changes Apr 12, 2022

View reviewed changes

Update docstring

7e4facf

djdameln approved these changes Apr 12, 2022

View reviewed changes

samet-akcay merged commit e706f95 into development Apr 12, 2022

samet-akcay deleted the fix/ashwin/ganomaly_image_size branch April 12, 2022 11:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐞fix support for non-square images #204

🐞fix support for non-square images #204

ashwinvaidya17 commented Apr 7, 2022

djdameln Apr 8, 2022

djdameln Apr 8, 2022

djdameln Apr 11, 2022

ashwinvaidya17 Apr 11, 2022

djdameln Apr 11, 2022

djdameln left a comment

djdameln Apr 12, 2022

		return torch.mean(torch.pow((latent_i - latent_o), 2), dim=1).view(-1) # convert nx1x1 to n

		def get_padded_tensor(self, batch: Tensor) -> Tensor:

		"""Compute required padding from input size and return padded images.

		Finds the largest dimension and computes a square image to pass into the model. In case the image dimension

🐞fix support for non-square images #204

🐞fix support for non-square images #204

Conversation

ashwinvaidya17 commented Apr 7, 2022

Description

Changes

Checklist

djdameln Apr 8, 2022

Choose a reason for hiding this comment

djdameln Apr 8, 2022

Choose a reason for hiding this comment

djdameln Apr 11, 2022

Choose a reason for hiding this comment

ashwinvaidya17 Apr 11, 2022

Choose a reason for hiding this comment

djdameln Apr 11, 2022

Choose a reason for hiding this comment

djdameln left a comment

Choose a reason for hiding this comment

djdameln Apr 12, 2022

Choose a reason for hiding this comment