Skip to content

Commit

Permalink
Faster GPU NMS operator (apache#16542)
Browse files Browse the repository at this point in the history
* Adding second NMS op

* NMS kernel

* Removing second sort

* Optimization

* Adding out-of-place ability to SortByKey

* Optimization pt2

* Optimizations pt3

* Do not recompute other boxes area every time

* Sort only topk results during second sorting

* Cleaning

* Fixes from rebase

* Fix lint and more fixes from rebase

* Fix typo

* Early exit in Triangle kernel

* Fixes

* Fix sort

* Fix from rebase

* Fix for the mixed naming convention

* Fix the index_t with int comparisoon
  • Loading branch information
ptrendx authored and yajiedesign committed Nov 6, 2019
1 parent 54faba0 commit 009a3a2
Show file tree
Hide file tree
Showing 4 changed files with 840 additions and 38 deletions.
1 change: 1 addition & 0 deletions src/operator/contrib/bounding_box.cc
Original file line number Diff line number Diff line change
Expand Up @@ -34,6 +34,7 @@ DMLC_REGISTER_PARAMETER(BoxOverlapParam);
DMLC_REGISTER_PARAMETER(BipartiteMatchingParam);
DMLC_REGISTER_PARAMETER(BoxDecodeParam);


NNVM_REGISTER_OP(_contrib_box_nms)
.add_alias("_contrib_box_non_maximum_suppression")
.describe(R"code(Apply non-maximum suppression to input.
Expand Down
Loading

0 comments on commit 009a3a2

Please sign in to comment.