We provide guidance for preparing the data used by DesCo. Note that not all data are needed for a specific experiments. Please check the Required Data fields in README to download necessary data. All data should by placed under the DATASET folder.

Most data preparation is similar to that of GLIP/FIBER. However, for training on Flickr30K and GoldG with negative captions, we need final_flickr_separateGT_train_gpt.0425.json and final_mixed_train_no_coco_with_nouns_gpt.0425.json.

`COCO`

Download the original COCO data into DATASET/coco folder. The contents should be organized as follows:

train2017

DATASET/coco/train2017
DATASET/coco/annotations/instances_train2017.json

val2017

DATASET/coco/val2017
DATASET/coco/annotations/instances_val2017.json

test2017

DATASET/coco/test2017
DATASET/coco/annotations/image_info_test-dev2017.json

train2014

DATASET/coco/train2014

`LVIS`

LVIS use the same images as COCO. Thus prepare the COCO images first.

DATASET/coco

Download the following annotation files: "wget https://huggingface.co/harold/DesCo/blob/main/lvis_od_train.json" -O coco/annotations/lvis_od_train.json" "wget https://huggingface.co/harold/DesCo/blob/main/lvis_v1_minival_inserted_image_name.json -O DATASET/coco/annotations/lvis_v1_minival_inserted_image_name.json" "wget https://huggingface.co/harold/DesCo/blob/main/lvis_od_val.json -O coco/annotations/lvis_od_val.json"

`Objects365`

We store Objects365 data in the TSV format. Please see link for a description of the TSV format. We provide the annotation files:

wget https://huggingface.co/harold/DesCo/blob/main/objects365_train_vgoiv6.cas2000.yaml -O DATASET/Objects365/objects365_train_vgoiv6.cas2000.yaml
wget https://huggingface.co/harold/DesCo/blob/main/train.label.tsv -O DATASET/Objects365/train.label.tsv
wget https://huggingface.co/harold/DesCo/blob/main/train.label.linelist.cas.2000.tsv -O DATASET/Objects365/train.label.linelist.cas.2000.tsv
wget https://huggingface.co/harold/DesCo/blob/main/train.label.lineidx -O DATASET/Objects365/train.label.lineidx
wget https://huggingface.co/harold/DesCo/blob/main/train.hw.tsv -O DATASET/Objects365/train.hw.tsv
wget https://huggingface.co/harold/DesCo/blob/main/train.hw.lineidx -O DATASET/Objects365/train.hw.lineidx
wget https://huggingface.co/harold/DesCo/blob/main/object365_vgoiv6_class2ind.json -O DATASET/Objects365/object365_vgoiv6_class2ind.json

We cannot host the image data. Please download the original image data and organize them into DATASET/Objects365/images.tsv and DATASET/Objects365/images.lineidx.

`Flickr30K`

Download the Flickr30K images from Link and put them under DATASET/flickr30k/flickr30k_images/. Download the MDETR annotations and put them under DATASET/mdetr_annotations/.

Additionally, download the annotation file with generated negative captions final_flickr_separateGT_train_gpt.0425.json.

The dataset structure should look like:

DATASET/flickr30k/flickr30k_images/
DATASET/mdetr_annotations/final_flickr_separateGT_*
DATASET/mdetr_annotations/final_flickr_separateGT_train_gpt.0425.json

`MixedGrounding`

This is the grounding dataset curated by MDETR. Please prepare the COCO train2014 data and put them under DATASET/coco/train2014. Prepare the GQA images and put them under DATASET/gqa/images/.

Then download the annotation files. The original MDETR annotation file contains COCO images; we provide a version without COCO images: wget https://huggingface.co/harold/DesCo/blob/main/final_mixed_train_no_coco.json -O DATASET/mdetr_annotations/final_mixed_train_no_coco.json.

Additionally, download the annotation file with generated negative captions final_mixed_train_no_coco_with_nouns_gpt.0425.json.

The dataset structure should look like:

DATASET/coco/train2014
DATASET/gqa/images
DATASET/mdetr_annotations/final_mixed_train_no_coco.json
DATASET/mdetr_annotations/final_flickr_separateGT_train_gpt.0425.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DATA.md

DATA.md

`COCO`

train2017

val2017

test2017

train2014

`LVIS`

`Objects365`

`Flickr30K`

`MixedGrounding`

Files

DATA.md

Latest commit

History

DATA.md

File metadata and controls

COCO

train2017

val2017

test2017

train2014

LVIS

Objects365

Flickr30K

MixedGrounding

`COCO`

`LVIS`

`Objects365`

`Flickr30K`

`MixedGrounding`