Support Mod in elementwise system #33052

JamesLim-sy · 2021-05-21T17:36:27Z

PR types

Performance optimization

PR changes

OPs

Describe

Basing on new elementwise + broadcast system support binary functors below : Mod
The performance variation is recorded in the statics table below:
As can be seen from the table, in most cases, the mod operation costs less CUDA operation time after adopting new elementwise + broadcast system. However, the old broadcast branch work better in the 2nd test case, the old broadcast branch consists of perf-optimized branch and common broadcast branch, the former one works well when the quantity of input tensor data is not big enough and the input tensor`s dim meet the special demands. Apparently, 2nd test case perfectly meets the that demands and data quantity is relatively small, therefore, it beats the new elementwise + broadcast op in 2nd case. But introducing of this branch dose make the code less compactness and hard to maintain, and working area of this is not big enough. Furthermore, the elementwise op in paddle is suggested to dealing with NN whose data quantity is often large, so i suggest Approval of 2nd test case and adopt the new elementwise + broadcast system in mod op.

paddle-bot-old · 2021-05-21T17:36:30Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

…time

paddle-bot-old · 2021-06-13T02:35:05Z

Sorry to inform you that 8b06d3c's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

Xreki

2021-06-22 20:01:35 [check_op_benchmark_result.py:80] [INFO] ------ OP: remainder_2 (forward) ------
2021-06-22 20:01:35 [check_op_benchmark_result.py:82] [INFO] GPU time change: 7.25394% (develop: 0.0223620 -> PR: 0.0239841)
2021-06-22 20:01:35 [check_op_benchmark_result.py:84] [INFO] Total time change: 1.92571% (develop: 0.0402996 -> PR: 0.0410757)
2021-06-22 20:01:35 [check_op_benchmark_result.py:85] [INFO] backward: False
2021-06-22 20:01:35 [check_op_benchmark_result.py:86] [INFO] parameters:
2021-06-22 20:01:35 [check_op_benchmark_result.py:88] [INFO] x (Variable) - dtype: float32, shape: [16, 2048, 7, 7]
2021-06-22 20:01:35 [check_op_benchmark_result.py:88] [INFO] y (Variable) - dtype: float32, shape: [16, 2048]
2021-06-22 20:01:35 [check_op_benchmark_result.py:88] [INFO] axis (int): 0
2021-06-22 20:01:35 [check_op_benchmark_result.py:153] [ERROR] Check speed result with case "remainder_2 (forward)" failed.

CI中该配置性能下降7%，但其他配置均有性能提升，故可以先合入该PR。

First_Commit.

4015a0e

JamesLim-sy added 8 commits May 24, 2021 14:25

adjust the elementwise-functor location

8ef5850

Fisrt commit

9d46543

Trigger of rerun

74e4179

To avoid spartial specification bugs which happened in PR-CI-ROCM

656ac99

Avoid kUnary instantiation of LaunchElementwiseCudaKernel at compile …

585566f

…time

refine warpper of broadcast and add cuda op

d9c70ec

merge conflict

5c65ab0

merge broadcast changes

078d6a6

JamesLim-sy changed the title ~~Support Mod binary functors in elementwise system~~ Support Mod functors in elementwise system Jun 1, 2021

JamesLim-sy changed the title ~~Support Mod functors in elementwise system~~ Support Mod in elementwise system Jun 1, 2021

JamesLim-sy added 2 commits June 5, 2021 08:36

fix conflicts

71c8fc0

Merge branch 'develop' into Adding_mod_div_binary_functor_support

8b06d3c

JamesLim-sy force-pushed the Adding_mod_div_binary_functor_support branch from a58de7f to 8b06d3c Compare June 5, 2021 09:03

ReCommit for 7 days limitation

aeb7744

Xreki approved these changes Jun 23, 2021

View reviewed changes

Xreki merged commit 1017180 into PaddlePaddle:develop Jun 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Mod in elementwise system #33052

Support Mod in elementwise system #33052

JamesLim-sy commented May 21, 2021 •

edited

Loading

paddle-bot-old bot commented May 21, 2021

paddle-bot-old bot commented Jun 13, 2021

Xreki left a comment

Support Mod in elementwise system #33052

Support Mod in elementwise system #33052

Conversation

JamesLim-sy commented May 21, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented May 21, 2021

paddle-bot-old bot commented Jun 13, 2021

Xreki left a comment

Choose a reason for hiding this comment

JamesLim-sy commented May 21, 2021 •

edited

Loading