This repository is about the code implementation of paper-Visual relationship detection with region topology structure. Here, we are very grateful for the public code provided by Liang et al. vrd-dsr The work of this paper is closely related to this.
ubuntu 16.04
pytorch 1.2.0 +
python 3.6.5
torch_geometric 1.1+
albumentations 0.3+
tabulate 0.8.0+
mkdir data/cache # caching file
mkdir data/pretrained_model # model checkpoint
mkdir data/pretrained_model/vg
mkdir data/pretrained_model/vrd
mkdir data/vrd # vrd dataset file
mkdir data/vg # vg dataset file
mkdir experiment # print result of each session
mkdir logs # logs file
mkdir models # resnet_101 pretrained weight file
vrd dataset
Download from vrd dataset
unzip file
cd data/vrd/
cp -r your/vrd/download/path/* ./
vg dataset
Download from vg dataset
unzip file
cd data/vg/
cp -r your/vg/download/path/* ./
Please download vg images from vg images
merge images to a file
next build soft link
ln -s your/vg/images/path ./images
vrd weight
Download from vrd weight
cd data/pretrained_model/vrd/
cp epoch_4_session_4_vrd_graph_rel.pth ./
vg weight
Download from vg weight
cd data/pretrained_model/vg/
cp epoch_7_session_5_vg_graph_rel.pth ./
First, you should load resnet_101 pretrained weight on imagenet.
Download resnet_101 pretrained_weight
cp your/resnet_101_wegint_path/resnet101-5d3b4d8f.pth models/
In the training phase, after training an epoch, we will conduct a Pre Det task evaluation.
cd tools
python train_graph.py --dataset vrd --name VRD_RANK --session 4 --device gpu --epochs 5
note:lr_step = [3, ]
python train_graph.py --dataset vg --name VRD_RANK --session 5 --device gpu --epochs 12
note: lr_step = [8, 11]
cd tools
for example: vrd dataset
python test_graph.py --ds_name vrd --device gpu--model_type Faster-RCNN --proposal ../data/faster-rcnn-detection/x101_test_proposals_20.pkl --resume ../data/pretrained_model/vrd/epoch_4_session_4_vrd_graph_rel.pth
If you use this code, please cite the following paper(s):
@article{vrd_graph,
title={Visual relationship detection with region topology structure},
author={Le Zhang,Ying Wang,HaiShun Chen,Jie Li,ZhenXi Zhang},
year={2020}
}