Add fp16 and fp64 support for topk #14912

anirudhacharya · 2019-05-07T21:46:44Z

Description

Add fp16 and fp64 support for topk operator. Required for certain machine translation tasks( and other NLP related tasks).

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

add fp16 and fp64 support for topk operator
Fixes FP16 support for topK #14125 mx.nd.topk does not work with ndarray of type float16 #11156 Fp16 support for top_k operator #12705

For review - @anirudh2290 @apeforest

vandanavk · 2019-05-08T16:45:56Z

@mxnet-label-bot add [pr-awaiting-review, Operator]

src/operator/tensor/ordering_op-inl.h

anirudhacharya · 2019-05-09T22:29:08Z

@marcoabreu @perdasilva any idea why the build fails for only windows gpu and passes for the rest?

perdasilva · 2019-05-10T12:39:01Z

I've had a look and also reached out to Anton. I have no idea what the problem here could be. I wonder if it has something to do with the environment. On Windows we seem to be using CUDA v9.2 and driver v398.75. But I don't see how this could really be an issue...=S

anirudhacharya · 2019-05-10T16:46:20Z

Thank you @perdasilva . @lebeg can you please help out here.

sxjscience · 2019-05-15T06:42:33Z

src/operator/tensor/ordering_op-inl.h

@@ -401,8 +401,6 @@ void TopKImpl(const RunContext &ctx,
    mxnet::op::SortByKeyWorkspaceSize<int, int, xpu>(src.Size()));
  temp_size = std::max(temp_size,
    mxnet::op::SortByKeyWorkspaceSize<int, DType, xpu>(src.Size()));
-  temp_size = std::max(temp_size,
-    mxnet::op::SortByKeyWorkspaceSize<DType, int, xpu>(src.Size()));


This line should be useful because we need to do mxnet::op::SortByKey(dat, ind, is_ascend, &sort_work);.

can't it be done even now? this change passed all the unit tests, btw.

Removing this line may break some corner cases uncovered by the existing test. Since we need to call SortByKey<DType, int>(dat, ind, ...), SortByKey<int, DType>(batch_id, dat, ...) and SortByKey<int, int>(batch_id, ind, ...), we should make sure that the temporary storage has enough size for all cases.

may break some corner cases uncovered by the existing test

it does not break any existing tests. I checked that locally. And the tests passed on the CI too.

we should make sure that the temporary storage has enough size for all cases.

I thought about this when making the change and tried to make sure. Is there a use-case/unit test you could point to which would break with this change.

abhinavs95 · 2019-05-31T21:59:12Z

@anirudh2290 @sxjscience Could you see if your comments are addressed? Thanks.

piyushghai · 2019-06-07T22:32:43Z

@anirudh2290 Bouncing for a review...

vandanavk · 2019-06-16T19:13:02Z

Is this PR good to go?

anirudhacharya changed the title ~~Add fp16 and fp64 support for for topk~~ Add fp16 and fp64 support for topk May 7, 2019

anirudhacharya force-pushed the fp16 branch from 18c35a8 to 37e7f8c Compare May 7, 2019 23:29

marcoabreu added Operator pr-awaiting-review PR is waiting for code review labels May 8, 2019

anirudhacharya force-pushed the fp16 branch from 37e7f8c to 5cefe60 Compare May 9, 2019 19:53

anirudh2290 reviewed May 9, 2019

View reviewed changes

src/operator/tensor/ordering_op-inl.h Outdated Show resolved Hide resolved

anirudhacharya force-pushed the fp16 branch from 5cefe60 to f801232 Compare May 9, 2019 21:10

sxjscience reviewed May 15, 2019

View reviewed changes

anirudhacharya force-pushed the fp16 branch from f801232 to fbeee21 Compare July 1, 2019 16:27

anirudhacharya requested review from eric-haibin-lin, gigasquid, nswamy, sergeykolychev, szha and yzhliu as code owners July 16, 2019 21:44

fp16 for topk

ebebebd

anirudhacharya force-pushed the fp16 branch from 2eb1e80 to ebebebd Compare July 16, 2019 22:15

anirudhacharya closed this Jul 16, 2019

anirudhacharya deleted the fp16 branch July 16, 2019 22:16

anirudhacharya restored the fp16 branch July 16, 2019 22:16

anirudhacharya mentioned this pull request Jul 16, 2019

Add fp16 support for topk #15560

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fp16 and fp64 support for topk #14912

Add fp16 and fp64 support for topk #14912

anirudhacharya commented May 7, 2019 •

edited

Loading

vandanavk commented May 8, 2019

anirudhacharya commented May 9, 2019

perdasilva commented May 10, 2019

anirudhacharya commented May 10, 2019

sxjscience May 15, 2019

anirudhacharya May 15, 2019 •

edited

Loading

sxjscience May 20, 2019

anirudhacharya May 20, 2019 •

edited

Loading

abhinavs95 commented May 31, 2019

piyushghai commented Jun 7, 2019

vandanavk commented Jun 16, 2019

Add fp16 and fp64 support for topk #14912

Add fp16 and fp64 support for topk #14912

Conversation

anirudhacharya commented May 7, 2019 • edited Loading

Description

Checklist

Essentials

Changes

vandanavk commented May 8, 2019

anirudhacharya commented May 9, 2019

perdasilva commented May 10, 2019

anirudhacharya commented May 10, 2019

sxjscience May 15, 2019

Choose a reason for hiding this comment

anirudhacharya May 15, 2019 • edited Loading

Choose a reason for hiding this comment

sxjscience May 20, 2019

Choose a reason for hiding this comment

anirudhacharya May 20, 2019 • edited Loading

Choose a reason for hiding this comment

abhinavs95 commented May 31, 2019

piyushghai commented Jun 7, 2019

vandanavk commented Jun 16, 2019

anirudhacharya commented May 7, 2019 •

edited

Loading

anirudhacharya May 15, 2019 •

edited

Loading

anirudhacharya May 20, 2019 •

edited

Loading