Implement generic object detector #601

kloudkl · 2014-07-03T11:56:27Z

This provides a common interface for different candidate object regions proposal algorithms. Together with #560, the pure C++ object detector consisted of high speed regions selector, feature extractor, classifier and region merger would become reality (#548).

@bhack, please have a look. Where is the latest BING ported to OpenCV?

bhack · 2014-07-03T12:42:20Z

@kloudkl https://github.com/fpuja/opencv_contrib/tree/saliencyModuleDevelop
There will be also video saliency region candidates from motion in the module before gsoc end. It is a saliency API (static, objectness, motion).

cc: @fpuja @lenlen

kloudkl · 2014-07-04T03:34:39Z

Thanks for your porting!

Are you confident that your module will be merged into the official OpenCV repository? If so, what is the earliest time? Before the API appears in the next release of OpenCV, I'm afraid it's only appropriate to use in one's internal applications. Any way, this PR will finish a baseline object detector that can be easily extended to use more advanced algorithms in any phase of the pipeline.

bhack · 2014-07-04T08:00:47Z

It is not so easy to reply to your question. In Opencv 3.0 opencv-contrib will become an official part of the project. But I don't know how will be the package maintainers policy in Linux distors.

@vpisarev can you give us some feedback on this outlook?

vpisarev · 2014-07-04T17:51:57Z

you are welcome to contribute some new functionality into OpenCV, not the main OpenCV, but into the contrib repository, for which we now have more or less automatic testing: http://pullrequest.opencv.org/#/summary/contrib. The module must follow our well, so far implicit guidelines, i.e. : have the same directory structure as other modules, use CMake as build system, use RST/Sphinx for documentation etc. Also, there should be some commitment to support the module at least for several months, i.e. you will be assigned all the bugs that users report. As soon as you submit pull request, we can evaluate it

bhack · 2014-07-04T18:20:04Z

@vpisarev I think that @kloudkl is interested to know if opencv gsoc projects will be merged in opencv-contrib and if opencv-contrib itself will have a new "official" distribution policy (tar.gz, distro's package etc.).

vpisarev · 2014-07-04T18:27:13Z

all the gsoc results will be put into opencv_contrib repository following the same guidelines outlined above and also at http://code.opencv.org/projects/opencv/wiki/How_to_contribute.
there is no any distribution policy that contributors should care about. The whole opencv_contrib can be downloaded as .zip directly from github and then built using the standard OpenCV build system. Binary packages for opencv_contrib will likely be prepared by OpenCV 3.0 beta (around September this year). Itseez team will take care of this, but we certainly appreciate any help. In any case, opencv_contrib packaging issue will be solved in whole, there is no need to invent something new for each contributed module.

kloudkl · 2014-07-05T08:28:06Z

@vpisarev, thanks for your detailed answers! Your explanations inspired me that there could also be a caffe_contrib repository accompanying the main project for Caffe in the future. The functionality that depends on additional third party libraries such as the BING module in OpenCV could be placed there.

kloudkl · 2014-07-05T17:10:07Z

TODO:

Test the non-maximum suppression regions merger
Complete the generic CNN object detector including tests
Train and test a object detection model on a public dataset with example training and testing network proto
Performance evaluation

ronghanghu · 2014-07-05T17:19:04Z

@kloudkl It seems that this PR is aligned with Rectangular Pooling #614, so that we can implement a spatial pyramid pooling detector mentioned in http://arxiv.org/pdf/1406.4729v1.pdf

kloudkl · 2014-07-05T17:53:22Z

@ronghanghu, how are they aligned in your opinion? I would rather say Rectangular Pooling and #560 naturally work together.

ronghanghu · 2014-07-05T17:58:01Z

@kloudkl Yes, you are right. I mean rectangular pooling #614 can be used in spatial pyramid pooling #560, so that a spatial pyramid pooling detector can be implemented using this PR.

kloudkl · 2014-07-12T16:16:44Z

It is really very hard to manage a PR that depends on so many others. To build and test this PR, multiple PRs including #560, #558 have to be mixed together. After all the features are completely tested and benchmarked, the commits that belong to this PR will be cherry-picked.

kloudkl · 2014-07-12T17:34:39Z

In order to speed up extracting features for thousand of regions that may contain objects, the spatial pyramid pooling layer directly pools the non-square regions of the feature maps. The most difficult part of the problem is that the layers used by the spp layer, i.e. split, pooling, flatten and concat layers must all support rectangular input blobs.

bhack · 2014-07-16T21:27:47Z

You can follow Bing PR here opencv/opencv_contrib#39

bhack · 2014-09-06T14:57:31Z

@kloudkl Bing objectness is merged now. You can find the docs here: http://docs.opencv.org/trunk/modules/saliency/doc/saliency.html

shelhamer · 2014-10-06T03:02:09Z

There are useful ideas and excerpts in this PR but the agglomeration is an uncomfortable mix of changes and concerns. Scope is at issue too: there are myriad ways to fit Caffe into a detector pipeline in one's own fork, but for inclusion in the main project it should be unobstrusive with respect to other tasks but generally useful for detection itself. My worry is that this is a somewhat individual effort for different kinds of detectors and best handled locally or by scripting, but that's merely my impression. Don't let that discourage proposals for detection pipeline PRs!

Closing for these reasons and because the originating fork was cancelled and the contributions in this branch are mostly either in-progress in other PRs or out-of-date.

kloudkl changed the title ~~Implement regions of interest generator for object detection~~ Implement generic object detector Jul 5, 2014

ronghanghu mentioned this pull request Jul 8, 2014

Implement SpatialPyramidPoolingLayer with the Split, Pooling, Flatten & Concat layers #560

Closed

kloudkl added 16 commits July 13, 2014 00:48

Implement regions of interest generator for object detection

997ecf2

Add Rect::area, intersect and empty methods

ae3b0b6

Refine the ROI generator API

2bc0d00

Add object detection module headers in caffe.hpp

810ec63

Add tool script to generate window data file in cpp

fed542b

Implement non-maximum suppression regions merger

80a5e8f

Implement the generic convolutional neural network object detector

04f57c3

Add object_detectors and regions_merger headers in caffe.hpp

e6d43a0

Define the ROIGenerator, RegionsMerger and ObjectDetector parameters

54a3ea4

Add more tests of the sliding window roi generator and nms merger

d547909

Blob can copy from a region of another blob

187924f

Composite SpatialPyramidPooling with Split, Pooling, Flatten & Concat

cb20c8f

Pooling layer allows float heights and widths for kernels and strides

4306c21

Support non-square float kernel and stride in spatial pyramid pooling

1bd63ef

Allow finetune_net to specify CPU or GPU mode and device id

e6a6466

Add more spatial bins to test the SpatialPyramidPoolingLayer

90169fe

kloudkl added 17 commits July 13, 2014 00:48

Test the pooling layers with float kernels and strides

468f18b

Test the spatial pyramid pooling layer with float kernels and strides

c14b29a

Simplify the verbose assignment & comparison in pooling layer tests

ea82979

Improve computing the pooled height and width in the pooling layer

2dfa979

Avoid the verbose assignment & comparison in spatial pooling layer tests

11edfc1

Add example network definitions using the spatial pyramid pooling layer

8f44a1b

Compute more accurate average pooling sizes in the PoolingLayer

ce94a98

Add spatial pyramid pooling in the directory name of examples/voc2012

5c65083

Feature extractor can get the keys of the samples from a text file

f1eccab

Reserve the shape when the extracted feature blobs are saved

dbd2372

Update the window data file generating script

3abb010

Name GenericCNNObjectDetector as SpatialPyramidPoolingNetObjectDetector

e3aa60c

Change snapshot prefix of the voc2012-spatial-pyramid-pooling example

58b2653

Fix the prefetch_rng bug in various data layers by @sguada

f0d5085

Add phase getter and setter & set the phase in tests for the data layer

c99b385

Implement the SpatialPyramidPoolingDataLayer to train and test detector

6aca1bb

Add example files to extract feautures, train & test spp-net detector

10daed4

This was referenced Aug 9, 2014

Multi label Data and MultiLabel Accuracy #523

Closed

State of the art Pascal VOC Multilabel #898

Closed

shelhamer force-pushed the dev branch 3 times, most recently from 4278286 to c01f07a Compare August 28, 2014 07:00

shelhamer force-pushed the dev branch from 64258b6 to 403b56b Compare September 19, 2014 04:38

shelhamer closed this Oct 6, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement generic object detector #601

Implement generic object detector #601

kloudkl commented Jul 3, 2014

bhack commented Jul 3, 2014

kloudkl commented Jul 4, 2014

bhack commented Jul 4, 2014

vpisarev commented Jul 4, 2014

bhack commented Jul 4, 2014

vpisarev commented Jul 4, 2014

kloudkl commented Jul 5, 2014

kloudkl commented Jul 5, 2014

ronghanghu commented Jul 5, 2014

kloudkl commented Jul 5, 2014

ronghanghu commented Jul 5, 2014

kloudkl commented Jul 12, 2014

kloudkl commented Jul 12, 2014

bhack commented Jul 16, 2014

bhack commented Sep 6, 2014

shelhamer commented Oct 6, 2014

Implement generic object detector #601

Implement generic object detector #601

Conversation

kloudkl commented Jul 3, 2014

bhack commented Jul 3, 2014

kloudkl commented Jul 4, 2014

bhack commented Jul 4, 2014

vpisarev commented Jul 4, 2014

bhack commented Jul 4, 2014

vpisarev commented Jul 4, 2014

kloudkl commented Jul 5, 2014

kloudkl commented Jul 5, 2014

ronghanghu commented Jul 5, 2014

kloudkl commented Jul 5, 2014

ronghanghu commented Jul 5, 2014

kloudkl commented Jul 12, 2014

kloudkl commented Jul 12, 2014

bhack commented Jul 16, 2014

bhack commented Sep 6, 2014

shelhamer commented Oct 6, 2014