Fix attn_bias_add bug. #37147

limin2021 · 2021-11-11T13:20:28Z

PR types

Bug fixes

PR changes

OPs

Describe

问题描述：
fused_attention_op的实现中，使用了bias_add，且其实现是通过使用kernel primitive来实现的，之后kernel primitive的WriteData api接口及函数内部实现发生了更改，将判断越界的逻辑移到了template的参数中，使得调用的分支有错误，产生了越界赋值操作，污染了别的显存空间的内容。具体表现为：test_fused_attention_op_api.py 单次执行基本上不会报错，多次循环执行不同shape的输入，结果计算不对，具有偶发性，bug不易察觉。

解决：
调用处，由
kernel_primitives::WriteData<OutT, VecSize, 1, 1>(out + fix, result, num);
改成：
kernel_primitives::WriteData<OutT, VecSize, 1, 1, true>(out + fix, result, num);

为了避免此类问题的发生，直接将fused_attention_op中对bias_add和reduce的实现改成直接调用paddle已有op的最外层接口进行实现。

2.修改了fused_attention和fused_feedforward op的api参数的english doc（根据 #36972 中chenlong的review进行修改）。
预览（只展示修改的地方）如下：

functional.fused_multi_head_attention：

nn.FusedMultiHeadAttention:

paddle-bot-old · 2021-11-11T13:20:35Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

xingfeng01

改动会对性能有影响吗？

xingfeng01 · 2021-11-12T02:09:34Z

paddle/fluid/operators/fused/attn_gemm.h

@@ -11,10 +11,14 @@ limitations under the License. */

 #pragma once

-#include "paddle/fluid/operators/fused/attn_bias_add.cu.h"
+// #include "paddle/fluid/operators/fused/attn_bias_add.cu.h"


建议在下个 PR 里删除掉不需要的行

之前手动实现的bias_add是模仿已有op的实现写的。现在恢复成直接调用已有op实现，对性能没有影响。

xingfeng01 · 2021-11-12T02:09:53Z

paddle/fluid/operators/fused/attn_gemm.h

    }
  }

-  void ComputeBackward(const T* input, const T* weight, const T* d_output,
-                       T* d_input, T* d_weight, T* d_bias) {
+  // void ComputeBackward(const T* input, const T* weight, const T* d_output,


建议在下个 PR 里删除不需要的行

AnnaTrainingG · 2021-11-12T02:15:49Z

LGTM for LaunchElementwiseCudaKernel

… fix-bias-add-bug

fused_attention_op的实现中，使用了bias_add，且其实现是通过使用kernel primitive来实现的，之后kernel primitive的WriteData api接口及函数内部实现发生了更改，将判断越界的逻辑移到了template的参数中，使得调用的分支有错误，产生了越界赋值操作，污染了别的显存空间的内容。具体表现为：test_fused_attention_op_api.py 单次执行基本上不会报错，多次循环执行不同shape的输入，结果计算不对，具有偶发性，bug不易察觉。

Fix bias_add bug.

c9e3439

xingfeng01 previously approved these changes Nov 12, 2021

View reviewed changes

limin2021 dismissed xingfeng01’s stale review via c9e3439 November 12, 2021 02:56

limin2021 force-pushed the fix-bias-add-bug branch from a692a7f to c9e3439 Compare November 12, 2021 02:56

Remove useless code.

5c5d678

zkh2016 mentioned this pull request Nov 12, 2021

[fix]fix the bug of fused_attention and fused_feedforward #36972

Merged

limin2021 added 3 commits November 12, 2021 11:36

Polish test_fused_attention_op_api.py

7231cb7

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

48896d6

… fix-bias-add-bug

Polish docs in fused_transformer.py

074b8b5

xingfeng01 previously approved these changes Nov 13, 2021

View reviewed changes

Polish english doc.

eb730ab

limin2021 dismissed xingfeng01’s stale review via eb730ab November 15, 2021 03:03

zkh2016 approved these changes Nov 15, 2021

View reviewed changes

xingfeng01 approved these changes Nov 15, 2021

View reviewed changes

TCChenlong approved these changes Nov 15, 2021

View reviewed changes

lanxianghit approved these changes Nov 16, 2021

View reviewed changes

lanxianghit merged commit a9e7a85 into PaddlePaddle:develop Nov 16, 2021

zkh2016 mentioned this pull request Nov 16, 2021

[cherry-pick-2.2.1]fix fused_transformer_encoder_layer bug #37229

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix attn_bias_add bug. #37147

Fix attn_bias_add bug. #37147

limin2021 commented Nov 11, 2021 •

edited

Loading

paddle-bot-old bot commented Nov 11, 2021

xingfeng01 left a comment

xingfeng01 Nov 12, 2021

limin2021 Nov 12, 2021

limin2021 Nov 12, 2021

xingfeng01 Nov 12, 2021

limin2021 Nov 12, 2021 •

edited

Loading

AnnaTrainingG commented Nov 12, 2021

Fix attn_bias_add bug. #37147

Fix attn_bias_add bug. #37147

Conversation

limin2021 commented Nov 11, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Nov 11, 2021

xingfeng01 left a comment

Choose a reason for hiding this comment

xingfeng01 Nov 12, 2021

Choose a reason for hiding this comment

limin2021 Nov 12, 2021

Choose a reason for hiding this comment

limin2021 Nov 12, 2021

Choose a reason for hiding this comment

xingfeng01 Nov 12, 2021

Choose a reason for hiding this comment

limin2021 Nov 12, 2021 • edited Loading

Choose a reason for hiding this comment

AnnaTrainingG commented Nov 12, 2021

limin2021 commented Nov 11, 2021 •

edited

Loading

limin2021 Nov 12, 2021 •

edited

Loading