Change to use MXNet's topk for CPUs in inference #506

ciyongch · 2018-08-13T08:47:21Z

Since MXNet's topk has better performance than numpy version with
PR apache/mxnet#12085, in order
to leverage such performance boost, change to use MXNet's topk for
CPU device when doing inference.

(description of the change)

Pull Request Checklist

Changes are complete (if posting work-in-progress code, prefix your pull request title with '[WIP]'
until you can check this box.
Unit tests pass (pytest)
Were system tests modified? If so did you run these at least 5 times to account for the variation across runs?
System tests pass (pytest test/system)
Passed code style checking (./style-check.sh)
You have considered writing a test
Updated major/minor version in sockeye/__init__.py. Major version bump if this is a backwards incompatible change.
Updated CHANGELOG.md

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

Since MXNet's topk has better performance than numpy version with PR apache/mxnet#12085, in order to leverage such performance boost, change to use MXNet's topk for CPU device when doing inference.

fhieber · 2018-08-13T08:55:13Z

We are already preparing Sockeye for upcoming MXNet 1.3 in the 'blocks' branch, which already switches to mxnet topk for both GPU and CPU: https://github.com/awslabs/sockeye/blob/blocks/sockeye/inference.py

I think, as long as mxnet==1.3 is not released, we should stay with the current best option (Sockeye/master always depends on released mxnet versions, not mxnet/master).

pengzhao-intel · 2018-08-13T09:45:01Z

@fhieber thanks for the information and it's fine to wait for the release 1.3.
Do you have a chance to try the performance with the new topk?
Feel free to let us know if any performance issue is found in your case.

ciyongch · 2018-08-14T00:38:47Z

@fhieber @pengzhao-intel Since there's already another plan to enable new MXNet topk, I would like to close this PR.

xinyu-intel · 2018-09-12T01:00:38Z

@fhieber Since MXNet 1.3.0 has been released, do you have any plan to switch to use MXNet's topk for
CPU device when doing inference?

fhieber · 2018-09-12T06:48:03Z

sure, I'll work on getting the 'blocks' branch up-to-date and merged into master while updating the requirements; probably this or next week.

Change to use MXNet's topk for CPUs in inference

3cd1d01

Since MXNet's topk has better performance than numpy version with PR apache/mxnet#12085, in order to leverage such performance boost, change to use MXNet's topk for CPU device when doing inference.

ciyongch requested review from davvil, fhieber, mjdenkowski and tdomhan as code owners August 13, 2018 08:47

ciyongch mentioned this pull request Aug 13, 2018

[Operator] Accelerate the CPU side performance of topk apache/mxnet#10205

Closed

ciyongch closed this Aug 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change to use MXNet's topk for CPUs in inference #506

Change to use MXNet's topk for CPUs in inference #506

ciyongch commented Aug 13, 2018

fhieber commented Aug 13, 2018

pengzhao-intel commented Aug 13, 2018

ciyongch commented Aug 14, 2018

xinyu-intel commented Sep 12, 2018

fhieber commented Sep 12, 2018

Change to use MXNet's topk for CPUs in inference #506

Change to use MXNet's topk for CPUs in inference #506

Conversation

ciyongch commented Aug 13, 2018

Pull Request Checklist

fhieber commented Aug 13, 2018

pengzhao-intel commented Aug 13, 2018

ciyongch commented Aug 14, 2018

xinyu-intel commented Sep 12, 2018

fhieber commented Sep 12, 2018