What is it?

This repository contains all the code for Diploma thesis in which we use object detection models trained on Open Images Dataset to tackle the problem of extracting useful information from the image content present on the web.

Initial setup

In order to be able to run object detectors we need to download and generate resources needed by the inference pipelines. Moreover, to maintain a reproducible environment, we recommend using virtualenv.

Prerequisites

Python 3.6

Setuping virtual environment

From the root folder:

pip install venv <-- if you don't have venv installed yet
python -m virtualenv venv <-- creates a virtual environment in the venv directory
source venv/bin/activate <-- activates the virtual environment
pip install -r requirements_local.txt <-- installs packages into the virtual environment
./set_python_path.sh <-- IMPORTANT: We write all Python code assuming PYTHONPATH is pointing to the root directory

Setuping Faster R-CNN

Follow the instructions here.

Setuping YOLOv3

Follow the instructions here.

Experiments

Check out notebooks folder for experiments.

Directory structure

Our repository is structured into multiple folders:

/models - code for object detection inference pipelines
/models/data - data structures used by object detection algorithms
/models/utils - utility functions used by object detection algorithms
/models/preprocessing - preprocessing used in the models
/models/yolov3 - YOLOv3 inference pipeline:
- /cpu_head/ - with inference head in Numpy (on CPU)
- /gpu_head_v1 - with inference head in TF, but non-max suppression in Numpy
- /gpu_head_v2 - with inference head and non-max suppression in TF
- /conversion - conversion from the Darknet framework to Keras
- /resources - resources needed by the model (e.g. weights, class labels)
/models/faster_rcnn_inception_resnet_v2_oid_v4- Faster R-CNN inference pipeline
/evaluation - code for computing evaluation metrics for object detection models
/evaluation/average_precision - our custom implementation for computing (m)AP
/notebooks - Jupyter notebooks which
- demonstrate how to use the object detectors
- contain scripts used for computing metrics
/utils - common utility functions
/common - common stuff i.e. custom argparse types
/scraper - scraper related stuff used for collecting dataset of annotable URLs

Object detectors

After initial setup of models, you can use object detectors in the way described below.

YOLOv3

from models.yolov3.object_detector import ObjectDetector as YOLOv3ObjectDetector

yolov3_detector = YOLOv3ObjectDetector(
    # Be verbose about times for inference
    verbose=True,

    # Enable logging of the device placement
    log_device_placement=True,

    # Control probability threshold for detecting objects
    detection_threshold=0.3,

    # Control threshold for non-max suppression
    nms_threshold=0.6
)

target_path = ... # specify target file path to the image
bounding_boxes = yolov3_detector.infer_bounding_boxes_on_target_path(target_path)

# Alternatively you can infer bounding boxes by loaded numpy image (watch out for row-major and col-major ordering issues!)
np_image = ... # load RGB image into numpy image in row-major ordering
bounding_boxes = yolov3_detector.infer_bounding_boxes_on_loaded_image(np_image)

Faster R-CNN

from models.faster_rcnn_inception_resnet_v2_oid_v4.object_detector import ObjectDetector as FasterRCNNObjectDetector

faster_rcnn_detector = FasterRCNNObjectDetector(
    # Enable use of the GPU
    use_gpu=True,

    # Enable logging of the device placement
    log_device_placement=True
)

target_path = ... # specify target file path to the image
bounding_boxes = faster_rcnn_detector.infer_bounding_boxes_on_target_path(target_path)

# Alternatively, you can infer bounding boxes by loaded numpy image (watch out for row-major and col-major ordering issues!)
np_image = ... # load RGB image into numpy image in row-major ordering
bounding_boxes = faster_rcnn_detector.infer_bounding_boxes_on_loaded_image(np_image)

Technologies used

TensorFlow
Keras
Scrapy
Jupyter
Pillow

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is it?

Initial setup

Prerequisites

Setuping virtual environment

Setuping Faster R-CNN

Setuping YOLOv3

Experiments

Directory structure

Object detectors

YOLOv3

Faster R-CNN

Technologies used

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 111 Commits
.vscode		.vscode
common		common
evaluation		evaluation
models		models
notebooks		notebooks
resources		resources
scraper		scraper
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
master_thesis_digital.pdf		master_thesis_digital.pdf
plotting.mplstyle		plotting.mplstyle
requirements_local.txt		requirements_local.txt
scrapy.cfg		scrapy.cfg
set_python_path.sh		set_python_path.sh

License

martin-galajda/object-detection

Folders and files

Latest commit

History

Repository files navigation

What is it?

Initial setup

Prerequisites

Setuping virtual environment

Setuping Faster R-CNN

Setuping YOLOv3

Experiments

Directory structure

Object detectors

YOLOv3

Faster R-CNN

Technologies used

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages