agnostic-regression for bbox #390

zimenglan-sysu-512 · 2019-01-28T14:34:58Z

No description provided.

…stributed

…d use 'BaseStem'&'Bottleneck' to simply codes

…ount of indentation and code duplication

…, and add dilation into ResNetHead

facebook-github-bot · 2019-01-28T14:35:14Z

Thank you for your pull request and welcome to our community. We require contributors to sign our Contributor License Agreement, and we don't seem to have you on file. In order for us to review and merge your code, please sign up at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need the corporate CLA signed.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

zimenglan-sysu-512 · 2019-01-28T14:36:29Z

relative to #111

fmassa · 2019-01-29T20:19:46Z

This is awesome, thanks a lot!

I started writing some comments, but then I realized that I needed to double think a few things. I'll try to get to reviewing it fully by the end of the week.

Also, did you launched trainings with class-agnostic box regression (to compare against Detectron)?

zimenglan-sysu-512 · 2019-01-31T05:11:50Z

hi @fmassa
i run e2e_mask_rcnn_R_50_FPN_1x with class-agnostic regression and ohem( but set POSITIVE_FRACTION to be 0.25) which i forgot to turn off (sorry for that), and get the results:

bbox
Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.380
Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.593
Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.414
Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.214
Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.410
Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.501
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.313
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.492
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.516
Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.324
Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.550
Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.661

mask
Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.343
Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.558
Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.363
Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.151
Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.367
Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.510
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.294
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.447
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.467
Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.264
Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.502
Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.626

it seems ohem helps a little. (i also train the same setting except class-agnostic regression, it got 37.9/34.4). i think the class-agnostic regression works

fmassa

This looks good to me, thanks a lot!

I have only one question, but I think this can be merged already.
I'd just like to discuss a bit about it before merging.

fmassa · 2019-02-04T15:44:03Z

maskrcnn_benchmark/modeling/roi_heads/box_head/roi_box_predictors.py

@@ -14,7 +14,8 @@ def __init__(self, config, pretrained=None):
        num_classes = config.MODEL.ROI_BOX_HEAD.NUM_CLASSES
        self.avgpool = nn.AvgPool2d(kernel_size=7, stride=7)
        self.cls_score = nn.Linear(num_inputs, num_classes)
-        self.bbox_pred = nn.Linear(num_inputs, num_classes * 4)
+        num_bbox_reg_classes = 2 if cfg.MODEL.CLS_AGNOSTIC_BBOX_REG else num_classes


this is how Detectron implements it: by using 0 for background and 1 for the other classes.

But I'm starting to wonder if this is actually necessary here: given that we index out all negative examples via sampled_pos_inds_subset, we don't backprop through the background elements anyway.
Given that, I think we could have only num_bbox_reg_classes = 1, and the rest of the code should work as well.

@rbgirshick what do you think? Am I missing something here?

fmassa · 2019-02-04T16:03:16Z

After a second thought, we still use the +1 for classes for the box regression, so in order to maintain consistency I'll keep the same here.

Thanks a lot!

zimenglan-sysu-512 · 2019-02-10T10:04:09Z

hi @fmassa
after merging from master branch, i re-run the experiment e2e_mask_rcnn_R_50_FPN_1x without ohem, and get 37.6&34.1.

fmassa · 2019-02-11T09:52:38Z

Hi @zimenglan-sysu-512 ,

Thanks for the information, those numbers look good I think, thanks!

xxradon · 2019-04-03T01:47:14Z

@zimenglan-sysu-512 I noticed you had implement OHEM on maskrcnn-benchmark,can you share your code or give us some suggestions to implement.Thank you very much.

zimenglan-sysu-512 · 2019-04-03T14:18:43Z

hi @xxradon
here is my implementation in these lines:

        # https://pytorch.org/docs/stable/_modules/torch/nn/functional.html#cross_entropy
        reduction = "mean" # default for 'F.cross_entropy'
        if self.reduction == "none":
            reduction = self.reduction
        classification_loss = F.cross_entropy(
            class_logits, labels, reduction=reduction
        )

        # get indices that correspond to the regression targets for
        # the corresponding ground truth labels, to be used with
        # advanced indexing
        sampled_pos_inds_subset = torch.nonzero(labels > 0).squeeze(1)
        labels_pos = labels[sampled_pos_inds_subset]
        if self.cls_agnostic_bbox_reg:
            map_inds = torch.tensor([4, 5, 6, 7], device=device)
        else:
            map_inds = 4 * labels_pos[:, None] + torch.tensor(
                [0, 1, 2, 3], device=device)

        reduction = self.reduction
        box_loss = smooth_l1_loss(
            box_regression[sampled_pos_inds_subset[:, None], map_inds],
            regression_targets[sampled_pos_inds_subset],
            reduction=reduction,
            beta=1,
        )
        if self.reduction in ['sum',]:
            box_loss = box_loss / labels.numel()

        if self.reduction == 'none':
            num_props = classification_loss.size()[0]
            box_loss_paste = torch.zeros((num_props,), device=device)
            box_loss_paste[sampled_pos_inds_subset] = box_loss
            box_loss = box_loss_paste

        return classification_loss, box_loss

u should modify the corresponding smooth_l1_loss to return the losses of all rois.

and then in box_head.py to sum the loss_classifier and loss_box_reg like this:

def ohem_for_bbox_loss(ohem_batch_size, loss_classifier, loss_box_reg):
    '''select the top-K highest-loss proposals'''
    ohem_batch_size = min(
        ohem_batch_size, loss_classifier.size()[0]
    )
    cls_box_loss = loss_classifier.detach() + loss_box_reg.detach()
    ohem_idxs = torch.topk(cls_box_loss, ohem_batch_size)[1]
    loss_classifier = loss_classifier[ohem_idxs]
    loss_box_reg = loss_box_reg[ohem_idxs]
    loss_classifier = loss_classifier.mean()
    loss_box_reg = loss_box_reg.mean()
    return loss_classifier, loss_box_reg

hope it can help u.
good luck.

zimenglan-sysu-512 · 2019-04-03T14:23:50Z

related to #439 and #630

xxradon · 2019-04-04T00:45:06Z

@zimenglan-sysu-512 Thank you very much,it's very helpful.

* make pixel indexes 0-based for bounding box in pascal voc dataset * replacing all instances of torch.distributed.deprecated with torch.distributed * replacing all instances of torch.distributed.deprecated with torch.distributed * add GroupNorm * add GroupNorm -- sort out yaml files * use torch.nn.GroupNorm instead, replace 'use_gn' with 'conv_block' and use 'BaseStem'&'Bottleneck' to simply codes * modification on 'group_norm' and 'conv_with_kaiming_uniform' function * modification on yaml files in configs/gn_baselines/ and reduce the amount of indentation and code duplication * use 'kaiming_uniform' to initialize resnet, disable gn after fc layer, and add dilation into ResNetHead * agnostic-regression for bbox

zimenglan-sysu-512 added 20 commits November 25, 2018 22:12

make pixel indexes 0-based for bounding box in pascal voc dataset

f93b369

Merge remote-tracking branch 'upstream/master'

45e4ba8

Merge remote-tracking branch 'upstream/master'

86caae2

replacing all instances of torch.distributed.deprecated with torch.di…

a46bfb9

…stributed

replacing all instances of torch.distributed.deprecated with torch.di…

7bbf46f

…stributed

Merge remote-tracking branch 'upstream/master'

7c8cf41

Merge remote-tracking branch 'upstream/master'

07a0f9c

Merge remote-tracking branch 'upstream/master'

6f09a6f

Merge remote-tracking branch 'upstream/master'

c4e3245

add GroupNorm

baba31f

add GroupNorm -- sort out yaml files

4877e36

use torch.nn.GroupNorm instead, replace 'use_gn' with 'conv_block' an…

d4ae039

…d use 'BaseStem'&'Bottleneck' to simply codes

modification on 'group_norm' and 'conv_with_kaiming_uniform' function

333864d

modification on yaml files in configs/gn_baselines/ and reduce the am…

58da4d5

…ount of indentation and code duplication

Merge remote-tracking branch 'upstream/master'

1798e63

Merge remote-tracking branch 'upstream/master'

ecc68c4

Merge remote-tracking branch 'upstream/master'

02a86f3

use 'kaiming_uniform' to initialize resnet, disable gn after fc layer…

9808a21

…, and add dilation into ResNetHead

Merge remote-tracking branch 'upstream/master'

d1ce06e

agnostic-regression for bbox

4de3488

fmassa self-requested a review January 29, 2019 20:19

Merge remote-tracking branch 'upstream/master'

52be8d7

fmassa approved these changes Feb 4, 2019

View reviewed changes

fmassa merged commit 9cb251d into facebookresearch:master Feb 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agnostic-regression for bbox #390

agnostic-regression for bbox #390

zimenglan-sysu-512 commented Jan 28, 2019

facebook-github-bot commented Jan 28, 2019

zimenglan-sysu-512 commented Jan 28, 2019

fmassa commented Jan 29, 2019

zimenglan-sysu-512 commented Jan 31, 2019 •

edited

Loading

fmassa left a comment

fmassa Feb 4, 2019

fmassa commented Feb 4, 2019

zimenglan-sysu-512 commented Feb 10, 2019 •

edited

Loading

fmassa commented Feb 11, 2019

xxradon commented Apr 3, 2019

zimenglan-sysu-512 commented Apr 3, 2019 •

edited

Loading

zimenglan-sysu-512 commented Apr 3, 2019 •

edited

Loading

xxradon commented Apr 4, 2019

agnostic-regression for bbox #390

agnostic-regression for bbox #390

Conversation

zimenglan-sysu-512 commented Jan 28, 2019

facebook-github-bot commented Jan 28, 2019

zimenglan-sysu-512 commented Jan 28, 2019

fmassa commented Jan 29, 2019

zimenglan-sysu-512 commented Jan 31, 2019 • edited Loading

fmassa left a comment

Choose a reason for hiding this comment

fmassa Feb 4, 2019

Choose a reason for hiding this comment

fmassa commented Feb 4, 2019

zimenglan-sysu-512 commented Feb 10, 2019 • edited Loading

fmassa commented Feb 11, 2019

xxradon commented Apr 3, 2019

zimenglan-sysu-512 commented Apr 3, 2019 • edited Loading

zimenglan-sysu-512 commented Apr 3, 2019 • edited Loading

xxradon commented Apr 4, 2019

zimenglan-sysu-512 commented Jan 31, 2019 •

edited

Loading

zimenglan-sysu-512 commented Feb 10, 2019 •

edited

Loading

zimenglan-sysu-512 commented Apr 3, 2019 •

edited

Loading

zimenglan-sysu-512 commented Apr 3, 2019 •

edited

Loading