[POC] Base class for dataset tests #3402

pmeier · 2021-02-15T16:13:36Z

Proof-of-concept for #3375.

We provide the fully-documented DatasetTestCase class that needs to be subclassed for each individual dataset. In most cases the user only has to configure the DATASET_CLASS to indicate which dataset should be tested and supply fake data within the inject_fake_data method. I've added tests for Caltech256, CIFAR10, and CIFAR100 to showcase this.

The user can add additional CONFIGS that are all tested instead of only the standard configuration. This can be used to test if the dataset works as intended for different splits or other parameter combinations.

Two things we need to discuss:

Some datasets require additional arguments without default values. For example

vision/torchvision/datasets/ucf101.py

Line 46 in 61e00d5

    
           def __init__(self, root, annotation_path, frames_per_clip, step_between_clips=1,

vision/torchvision/datasets/coco.py

Lines 49 to 52 in 61e00d5

    
           def __init__( 
        
                   self, 
        
                   root: str, 
        
                   annFile: str,

We need an interface for the user to be able to provide them. My idea was to put this into inject_fake_data. In addition to the info dict, it could return a sequence of arguments passed to the constructor. If not returned, we could simply assume that only root should be passed, making this optional.

As of now, no tests for the (target_)transforms? is implemented. Writing the test is trivial, but we need a way to indicate which transform is applied to which output argument. Most of the time transform is applied to the first, target_transform to the second, and transforms to the first two. But there are exceptions to this:

vision/torchvision/datasets/phototour.py

Lines 106 to 109 in 61e00d5

    
           if self.transform is not None: 
        
               data1 = self.transform(data1) 
        
               data2 = self.transform(data2) 
        
           return data1, data2, m[2]

vision/torchvision/datasets/ucf101.py

Lines 108 to 111 in 61e00d5

    
           if self.transform is not None: 
        
               video = self.transform(video) 
        
           return video, audio, label

My idea is to provide TRANSFORM_IDCS and TARGET_TRANSFORM_IDCS class attributes (TRANSFORMS_IDCS is not necessary as it can be combined from the other two). These would be mandatory, but could have good default values in (Image|Video)DatasetTestCase.

…videos

pmeier · 2021-02-16T12:25:27Z

I suspect the tests fail on Windows, because img = PIL.Image.open() only loads the image lazily. If I understand the documentation correctly, a user should call img.load() or any method that accesses the pixel data of the image after opening. After that the file that the data was loaded from can be deleted.

codecov · 2021-02-16T16:04:45Z

Codecov Report

Merging #3402 (3315ae5) into master (c1f85d3) will increase coverage by 0.30%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #3402      +/-   ##
==========================================
+ Coverage   74.83%   75.13%   +0.30%     
==========================================
  Files         105      105              
  Lines        9722     9722              
  Branches     1563     1563              
==========================================
+ Hits         7275     7305      +30     
+ Misses       1960     1930      -30     
  Partials      487      487

Impacted Files	Coverage Δ
torchvision/datasets/vision.py	`65.51% <0.00%> (+1.72%)`	⬆️
torchvision/datasets/cifar.py	`90.00% <0.00%> (+13.75%)`	⬆️
torchvision/datasets/caltech.py	`42.52% <0.00%> (+20.68%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c1f85d3...3315ae5. Read the comment docs.

fmassa

This looks great!

I have a few comments, specially about the handling of PIL opened images on Windows, let me know what you think.

About your questions:

1 - I might be missing something here, but why not leave those to be available in the config?

2 - Let's leave transform testing out for now. There might be a way to test that the transform is called without having to let the user specify it. Maybe by creating dummy transformer classes that return a specific dummy value, and checking the presence of this dummy value in the returned dataset.

test/datasets_utils.py

pmeier · 2021-02-17T11:20:40Z

1 - I might be missing something here, but why not leave those to be available in the config?

I think that would be only reasonable if there is a good and independent default value:

vision/torchvision/datasets/mnist.py

Line 263 in c1f85d3

def __init__(self, root: str, split: str, **kwargs: Any) -> None:

The datasets I mentioned require an additional directory where the assumptions cannot be met. The annotation directory should be inside temporary directory which is only created shortly before inject_fake_data(). Furthermore, the additional directory should not be in root. For example, UCF101 collects the available classes by listing the contents of root:

vision/torchvision/datasets/ucf101.py

Line 58 in c1f85d3

classes = list(sorted(list_dir(root)))

Thus if the annotation directory is inside root, it will be recognized as class.

In most cases the temporary directory passed to inject_fake_data can be considered root, but as shown there are exceptions. For UCF101 the args should be something like root="/tmp/videos", annotation_root="/tmp/annotations".

2 - Let's leave transform testing out for now. There might be a way to test that the transform is called without having to let the user specify it. Maybe by creating dummy transformer classes that return a specific dummy value, and checking the presence of this dummy value in the returned dataset.

So instead of checking if the returned values are correct you only want to check if the (target_)?transforms? were called when accessing an example? That is not hard to implement. Note that this can only check if the (target_)?transforms? was called rather than what it was called with. For the latter we have the same issue as before, since without user interference we can't know which arguments should be passed to the transform.

fmassa

LGTM, thanks!

Summary: * add base class for datasets tests * add better type hints * add documentation to subclasses * add utility functions to create files / folders of random images and videos * fix imports * remove class properties * fix smoke test * fix type hints * fix random size generation * add Caltech256 as example * add utility function to create grid of combinations * add CIFAR100? as example * lint * add missing import * improve documentation * create 1 frame videos by default * remove obsolete check * return path of files created with utility functions * [test] close PIL file handles before deletion * fix video folder creation * generalize file handle closing * fix lazy imports * add test for transforms * fix explanation comment * lint * force load opened PIL images * lint * copy default config to avoid inplace modification * enable additional arg forwarding Reviewed By: NicolasHug Differential Revision: D26605320 fbshipit-source-id: c54903ea5a6b57ddfa57c560e579e0834cf0cc11

facebook-github-bot added the cla signed label Feb 15, 2021

pmeier requested a review from fmassa February 16, 2021 09:25

pmeier added 14 commits February 16, 2021 13:09

add base class for datasets tests

aba8779

add better type hints

fd8d48a

add documentation to subclasses

cb29187

add utility functions to create files / folders of random images and …

11b27e4

…videos

fix imports

3115e1a

remove class properties

5f32d77

fix smoke test

2cc4d77

fix type hints

2964577

fix random size generation

5993f8c

add Caltech256 as example

8dca6e4

add utility function to create grid of combinations

37eeff7

add CIFAR100? as example

a9526e1

lint

857c5a8

add missing import

e85f996

pmeier force-pushed the improve-datasets-testing branch from 11773da to e85f996 Compare February 16, 2021 12:09

pmeier added 7 commits February 16, 2021 14:38

improve documentation

5ecd061

create 1 frame videos by default

1175a32

remove obsolete check

d164ea9

return path of files created with utility functions

9cadab1

[test] close PIL file handles before deletion

77fa716

fix video folder creation

c2b3b0a

generalize file handle closing

6f05ca0

fmassa reviewed Feb 17, 2021

View reviewed changes

test/datasets_utils.py Outdated Show resolved Hide resolved

test/datasets_utils.py Outdated Show resolved Hide resolved

test/datasets_utils.py Outdated Show resolved Hide resolved

pmeier added 2 commits February 17, 2021 14:05

fix lazy imports

d3f9268

add test for transforms

e7d1675

pmeier added 2 commits February 17, 2021 14:35

fix explanation comment

0662533

lint

9773089

pmeier requested a review from fmassa February 17, 2021 13:40

pmeier added 2 commits February 17, 2021 15:49

force load opened PIL images

c517a5a

lint

4c1ff7c

fmassa approved these changes Feb 17, 2021

View reviewed changes

pmeier changed the base branch from tests-datasets to master February 17, 2021 16:16

pmeier added 3 commits February 17, 2021 17:43

copy default config to avoid inplace modification

170f700

enable additional arg forwarding

aba3ee0

Merge branch 'master' into improve-datasets-testing

3315ae5

pmeier requested a review from fmassa February 17, 2021 17:04

fmassa merged commit 22c548b into pytorch:master Feb 18, 2021

pmeier deleted the improve-datasets-testing branch February 18, 2021 09:29

This was referenced Feb 18, 2021

Add tests for datasets #963

Closed

Remove caching from MNIST and variants #3420

Merged

Improve testing of datasets #3375

Closed

pmeier mentioned this pull request Feb 24, 2021

remove old CIFAR tests and fake data generation #3447

Merged

pmeier mentioned this pull request Mar 9, 2021

Port remaining dataset tests to new architecture #3531

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[POC] Base class for dataset tests #3402

[POC] Base class for dataset tests #3402

pmeier commented Feb 15, 2021 •

edited

Loading

pmeier commented Feb 16, 2021

codecov bot commented Feb 16, 2021 •

edited

Loading

fmassa left a comment

pmeier commented Feb 17, 2021

fmassa left a comment

	if self.transform is not None:
	data1 = self.transform(data1)
	data2 = self.transform(data2)
	return data1, data2, m[2]

	if self.transform is not None:
	video = self.transform(video)

	return video, audio, label

[POC] Base class for dataset tests #3402

[POC] Base class for dataset tests #3402

Conversation

pmeier commented Feb 15, 2021 • edited Loading

pmeier commented Feb 16, 2021

codecov bot commented Feb 16, 2021 • edited Loading

Codecov Report

fmassa left a comment

Choose a reason for hiding this comment

pmeier commented Feb 17, 2021

fmassa left a comment

Choose a reason for hiding this comment

pmeier commented Feb 15, 2021 •

edited

Loading

codecov bot commented Feb 16, 2021 •

edited

Loading