Update docstring for RandAugment #4457

VinhLoiIT · 2021-09-22T09:14:22Z

Fixes #4456

datumbox

Thanks for the PR @VinhLoiIT.

I left a couple of comments, please let me know what you think.

torchvision/transforms/autoaugment.py

datumbox · 2021-09-22T09:23:42Z

torchvision/transforms/autoaugment.py

@@ -333,7 +333,7 @@ class TrivialAugmentWide(torch.nn.Module):
    r"""Dataset-independent data-augmentation with TrivialAugment Wide, as described in
    `"TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation" <https://arxiv.org/abs/2103.10158>`.
    If the image is torch Tensor, it should be of type torch.uint8, and it is expected
-    to have [..., 1 or 3, H, W] shape, where ... means an arbitrary number of leading dimensions.
+    to have [..., 3, H, W] shape, where ... means an arbitrary number of leading dimensions.
    If img is PIL Image, it is expected to be in mode "L" or "RGB".


Suggested change

If img is PIL Image, it is expected to be in mode "L" or "RGB".

If img is PIL Image, it is expected to be in mode "RGB".

Hi @datumbox ,

The below snippet still works when I pass a pillow image with the mode is 'L'. This definition of "work" here is that it does not raise the above issue. I've also checked some of the results and see that they are all good. If then, does it come from the tensor transformation side?

from PIL import Image from pathlib import Path import torch import torchvision from torchvision import transforms image_path = 'n02100877_4699.jpg' raw_image = Image.open(image_path).convert('L') # try with 'L', 'RGB print(image.size, image.mode) tf = transforms.AutoAugment(transforms.autoaugment.AutoAugmentPolicy.CIFAR10) for seed in range(0, 2000): torch.manual_seed(seed) image = tf(raw_image) image.save(out_dir / f"{seed}.png", 'png')

Here is the already setup colab notebook that you could manually check yourself: https://colab.research.google.com/drive/1WtlOAASAGJA927oeFECN2YSFAhlMJ6j2?usp=sharing

Hmm, thanks for investigating. It seems that PIL supports the colour operators even when the image is grayscale.

I think we have a few options:

Update the documentation for Tensors only since PIL supports L

Update the documentation for both to avoid confusing people

Update the implementations of colour transforms of TorchVision that don't support Grayscale images to align them with PIL.

I think option 3 is worth exploring but it will take longer and needs to happen carefully. So I would be in favour of doing 1 or 2 in this PR and follow up on 3 on a separate one. I don't have strong opinion over how we go, so let me know what you prefer.

Yeah, I think that this PR is just for updating the documentation only to keep it relates to the original issue. Furthermore, since we have already declared some policies such as ImageNet, CIFAR10 that their input is not a grayscale image. Thus, changing to support grayscale images would have some conflicts with the policy definitions as well that we might have further discussion in another thread.

In conclusion, in this PR, I would change the document so that this transformation only supports RGB for now.

datumbox

@VinhLoiIT LGTM, thanks a lot for the contribution and the detailed investigation.

Summary: * update docstring for RandAugment * update docstrings for pillow image Reviewed By: datumbox Differential Revision: D31268031 fbshipit-source-id: cc688541ff66b7ad78a2528b1c69424838be5971

update docstring for RandAugment

231054c

facebook-github-bot added the cla signed label Sep 22, 2021

datumbox reviewed Sep 22, 2021

View reviewed changes

datumbox added the module: documentation label Sep 22, 2021

update docstrings for pillow image

a414551

datumbox approved these changes Sep 22, 2021

View reviewed changes

datumbox merged commit 972ca65 into pytorch:main Sep 22, 2021

This was referenced Sep 22, 2021

Support grayscale image-like tensors for adjust_contrast and adjust_saturation functions #4466

Closed

Added gray image support to adjust_saturation function #4480

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update docstring for RandAugment #4457

Update docstring for RandAugment #4457

VinhLoiIT commented Sep 22, 2021 •

edited by datumbox

Loading

datumbox left a comment

datumbox Sep 22, 2021

VinhLoiIT Sep 22, 2021

VinhLoiIT Sep 22, 2021

datumbox Sep 22, 2021

VinhLoiIT Sep 22, 2021

datumbox left a comment

	If img is PIL Image, it is expected to be in mode "L" or "RGB".
	If img is PIL Image, it is expected to be in mode "RGB".

Update docstring for RandAugment #4457

Update docstring for RandAugment #4457

Conversation

VinhLoiIT commented Sep 22, 2021 • edited by datumbox Loading

datumbox left a comment

Choose a reason for hiding this comment

datumbox Sep 22, 2021

Choose a reason for hiding this comment

VinhLoiIT Sep 22, 2021

Choose a reason for hiding this comment

VinhLoiIT Sep 22, 2021

Choose a reason for hiding this comment

datumbox Sep 22, 2021

Choose a reason for hiding this comment

VinhLoiIT Sep 22, 2021

Choose a reason for hiding this comment

datumbox left a comment

Choose a reason for hiding this comment

VinhLoiIT commented Sep 22, 2021 •

edited by datumbox

Loading