Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

Commit

Permalink
Faster GPU NMS operator (#16542)
Browse files Browse the repository at this point in the history
* Adding second NMS op

* NMS kernel

* Removing second sort

* Optimization

* Adding out-of-place ability to SortByKey

* Optimization pt2

* Optimizations pt3

* Do not recompute other boxes area every time

* Sort only topk results during second sorting

* Cleaning

* Fixes from rebase

* Fix lint and more fixes from rebase

* Fix typo

* Early exit in Triangle kernel

* Fixes

* Fix sort

* Fix from rebase

* Fix for the mixed naming convention

* Fix the index_t with int comparisoon
  • Loading branch information
ptrendx authored and zhreshold committed Nov 5, 2019
1 parent c583e44 commit 0c5677e
Show file tree
Hide file tree
Showing 4 changed files with 840 additions and 38 deletions.
1 change: 1 addition & 0 deletions src/operator/contrib/bounding_box.cc
Original file line number Diff line number Diff line change
Expand Up @@ -34,6 +34,7 @@ DMLC_REGISTER_PARAMETER(BoxOverlapParam);
DMLC_REGISTER_PARAMETER(BipartiteMatchingParam);
DMLC_REGISTER_PARAMETER(BoxDecodeParam);


NNVM_REGISTER_OP(_contrib_box_nms)
.add_alias("_contrib_box_non_maximum_suppression")
.describe(R"code(Apply non-maximum suppression to input.
Expand Down
Loading

0 comments on commit 0c5677e

Please sign in to comment.