fix potential floating number overflow, enable float16 #12118

zhreshold · 2018-08-10T01:20:40Z

Description

contrib.box_nms and contrib.bipartite_matchin use floating number array to store indices which can be extremely large.
To avoid overflow when float32/float16 is casted to integer, this PR force use int32_t in indices arrays. (the new limit is 2^31 - 1 instead of 2^k + 1 where k is the mantissa bits of float type)

Test updated for various dtypes
no functionality change
no API change

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

zhreshold · 2018-08-10T01:21:39Z

@ijkguo

larroy · 2018-08-11T00:39:07Z

Why don't you use size_t instead of int32_t?

zhreshold · 2018-08-11T01:24:48Z

@larroy Negative number required to mark some indices for special purposes.

larroy · 2018-08-11T14:28:48Z

ssize_t then? It’s not portable otherwise

larroy · 2018-08-14T13:10:10Z

@zhreshold did you see my previous comment? Thanks.

zhreshold · 2018-08-14T17:56:21Z

@larroy Sorry I missed that. But I don't quite get your point. Isn't int32_t portable enough through stdint standard? ssize_t is more problematic in terms of portability because it is not ensured to be at least 4 bytes

larroy · 2018-08-14T22:28:25Z

http://pubs.opengroup.org/onlinepubs/7908799/xsh/limits.h.html
Ok, seems negative range is not well specified.

zhreshold · 2018-08-15T21:21:46Z

CI always fails at deploy stage. Do you have a clue? @larroy

larroy · 2018-08-15T21:30:00Z

minor problem with doc generation. Are you rebased against master?

zhreshold · 2018-08-15T21:31:07Z

@larroy Yep, just rebased 2 hours ago

larroy · 2018-08-15T22:34:01Z

It's broken in master #11990

* fix potential floating number overflow, enable float16 * fix cuda impl * fix cuda imple * fix template substitution for windows * half_f substantiate operand + fix * remove ambiguous operand + for mshadow half_T * fix con't * use int32_t as indices * use overload * try remove ambiguous function overloading * thrust version limit * change sizeof cast from floor to ceil when allocating buffers * cleaner * fix alignment of pointers

zhreshold requested a review from anirudh2290 as a code owner August 10, 2018 01:20

zhreshold force-pushed the box-nms-fix branch from 8f77818 to 7930895 Compare August 14, 2018 02:15

zhreshold force-pushed the box-nms-fix branch from 87b2275 to 892b9ac Compare August 15, 2018 17:56

zhreshold added 14 commits August 16, 2018 11:31

fix potential floating number overflow, enable float16

30d85e2

fix cuda impl

9daf3f5

fix cuda imple

edb31e8

fix template substitution for windows

0ab0695

half_f substantiate operand + fix

0b90e5a

remove ambiguous operand + for mshadow half_T

6915172

fix con't

62a106e

use int32_t as indices

4c86438

use overload

565a9c1

try remove ambiguous function overloading

38eeece

thrust version limit

75c9240

change sizeof cast from floor to ceil when allocating buffers

c5688c1

cleaner

e72707c

fix alignment of pointers

e09ef1a

zhreshold force-pushed the box-nms-fix branch from 892b9ac to e09ef1a Compare August 16, 2018 18:31

piiswrong merged commit c479eb2 into apache:master Aug 20, 2018

zhreshold deleted the box-nms-fix branch August 20, 2018 21:12

zhreshold mentioned this pull request Aug 27, 2018

YOLOv3 evaluation get different mAP on COCO by different batch_size dmlc/gluon-cv#273

Closed

zhreshold mentioned this pull request Sep 17, 2018

eval_ssd speed drops with increasing batch size dmlc/gluon-cv#313

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix potential floating number overflow, enable float16 #12118

fix potential floating number overflow, enable float16 #12118

zhreshold commented Aug 10, 2018 •

edited

Loading

zhreshold commented Aug 10, 2018

larroy commented Aug 11, 2018

zhreshold commented Aug 11, 2018

larroy commented Aug 11, 2018

larroy commented Aug 14, 2018

zhreshold commented Aug 14, 2018

larroy commented Aug 14, 2018

zhreshold commented Aug 15, 2018

larroy commented Aug 15, 2018

zhreshold commented Aug 15, 2018

larroy commented Aug 15, 2018 •

edited

Loading

fix potential floating number overflow, enable float16 #12118

fix potential floating number overflow, enable float16 #12118

Conversation

zhreshold commented Aug 10, 2018 • edited Loading

Description

Checklist

Essentials

zhreshold commented Aug 10, 2018

larroy commented Aug 11, 2018

zhreshold commented Aug 11, 2018

larroy commented Aug 11, 2018

larroy commented Aug 14, 2018

zhreshold commented Aug 14, 2018

larroy commented Aug 14, 2018

zhreshold commented Aug 15, 2018

larroy commented Aug 15, 2018

zhreshold commented Aug 15, 2018

larroy commented Aug 15, 2018 • edited Loading

zhreshold commented Aug 10, 2018 •

edited

Loading

larroy commented Aug 15, 2018 •

edited

Loading