Skip to content
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.

Latest commit

 

History

History
49 lines (38 loc) · 964 Bytes

DATA.md

File metadata and controls

49 lines (38 loc) · 964 Bytes

Setting Up Data Paths

pycls finds datasets via symlinks from pycls/datasets/data to the actual locations where the dataset images and labels are stored. The instructions on how to create symlinks for ImageNet and CIFAR are given below.

Expected datasets structure for ImageNet:

imagenet
|_ train
|  |_ n01440764
|  |_ ...
|  |_ n15075141
|_ val
|  |_ n01440764
|  |_ ...
|  |_ n15075141
|_ ...

Expected datasets structure for CIFAR-10:

cifar10
|_ data_batch_1
|_ data_batch_2
|_ data_batch_3
|_ data_batch_4
|_ data_batch_5
|_ test_batch
|_ ...

Create a directory containing symlinks:

mkdir -p /path/pycls/pycls/datasets/data

Symlink ImageNet (/datasets01/imagenet_full_size/061417/ on FAIR cluster):

ln -sv /path/imagenet /path/pycls/pycls/datasets/data/imagenet

Symlink CIFAR-10 (/datasets01/cifar-10-batches-py/060817/ on FAIR cluster):

ln -sv /path/cifar10 /path/pycls/pycls/datasets/data/cifar10