Faster-R-CNN-with-model-pretrained-on-Visual-Genome

Faster RCNN model in Pytorch version, pretrained on the Visual Genome with ResNet-101

Introduction

we provide

Pretrained Faster RCNN model, which is trained with Visual Genome + ResNet-101 + Pytorch
Pytorch implementation of processing data tools, the Caffe version of which is provided by the 'bottom-up-attention'

Model

we use the same setting and benchmark as faster-rcnn.pytorch. The results of the model are shown below.

model	dataset	#GPUs	batch size	lr	lr_decay	max_epoch	mAP
Res-101	Visual Genome	1 1080 TI	4	1e-3	5	20	10.19

Download the pretrained model and put it to the folder $load_dir.

Utilization

Prerequisites

Python 3.6 or higher
Pytorch 1.0

Preparation

Clone the code

git clone https://github.com/shilrley6/Faster-R-CNN-with-model-pretrained-on-Visual-Genome.git

Pretrained image model

Download the pretrained VGG16 and ResNet101 models according to your requirement, which are provided by faster-rcnn.pytorch.

VGG16: Dropbox, VT Server
ResNet101: Dropbox, VT Server

Then put them into the path data/pretrained_model/.

Compilation

Install all the python dependencies using pip:

pip install -r requirements.txt

Compile the cuda dependencies using following simple commands:

cd lib
python setup.py build develop

Data processing

Generate tsv

Run genearte_tsv.py to extract features of image regions. The output file format will be a tsv, where the columns are ['image_id', 'image_w', 'image_h', 'num_boxes', 'boxes', 'features'].

python generate_tsv.py --net res101 --dataset vg \
                       --out $out_file --cuda

Use the parameter $load_dir (the path to the model, defult is '/models') to adapt your environment. Change the parameter $out_file to the path of the output file.

PS. If you download other pretrained models, you can rename the model as 'faster_rcnn_$net_$dataset.pth' and modify the parameter $net and $dataset.

Convert data

Run convert_data.py to convert the above output to a numpy array. The output file $output_file format will be a npy, including image region features.

python convert_data.py --imgid_list $imgid_list \
                       --input_file $input_file --output_file $output_file

The parameter $imgid_list is the path to a list of image id, in the format of '.txt'.

Demo

You can use this function to show object detections on demo images with a pre-trained model by running:

# python demo.py --net res101 --dataset vg \
                 --load_dir models --cuda

You can also add images to folder $image_dir and change the parameter $image_file to the filename.

PS. If you download other pretrained models, you can rename the model as 'faster_rcnn_$net_$dataset.pth' and modify the parameter $net and $dataset.

Acknowledgments

Thanks to 'bottom-up-attention' and faster-rcnn.pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
cfgs		cfgs
data/genome		data/genome
images		images
lib		lib
README.md		README.md
_init_paths.py		_init_paths.py
convert_data.py		convert_data.py
demo.py		demo.py
generate_tsv.py		generate_tsv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Faster-R-CNN-with-model-pretrained-on-Visual-Genome

Introduction

Model

Utilization

Prerequisites

Preparation

Pretrained image model

Compilation

Data processing

Generate tsv

Convert data

Demo

Acknowledgments

About

Releases

Packages

Languages

ustcnewly/Faster-R-CNN-with-model-pretrained-on-Visual-Genome

Folders and files

Latest commit

History

Repository files navigation

Faster-R-CNN-with-model-pretrained-on-Visual-Genome

Introduction

Model

Utilization

Prerequisites

Preparation

Pretrained image model

Compilation

Data processing

Generate tsv

Convert data

Demo

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages