add finetune fused & add mc2 #8139

NINGBENZHE · 2024-03-18T09:22:21Z

No description provided.

paddle-bot · 2024-03-18T09:22:26Z

Thanks for your contribution!

wuhuachaocoding · 2024-03-22T06:16:02Z

paddlenlp/transformers/sequence_parallel_utils.py

MC2的单独放在一个文件？

这两个MC2属于sp的一部分，暂不移动？

wuhuachaocoding · 2024-03-22T06:19:42Z

paddlenlp/transformers/llama/modeling.py

@@ -228,6 +228,11 @@ def scaled_dot_product_attention(
                alibi = alibi.reshape([bsz, num_heads, 1, -1])
                attention_mask = attention_mask.cast(alibi.dtype) + alibi
            if get_env_device() == "npu":
+                if attention_mask is not None:
+                    attention_mask = attention_mask.astype("bool")


这个地方看下在外面判断attn_mask一次，不是bool类型才cast, 可以提高性能。看下不在flash_attn里面cast, 在传入Transformer之前判断并cast一次就行？

wuhuachaocoding · 2024-03-22T06:21:30Z

paddlenlp/transformers/llama/modeling.py

@@ -239,6 +244,7 @@ def scaled_dot_product_attention(
                    attention_mask is None,
                    True,
                    False,
+                    is_casual


这里建议别判断casual了，SFT/lora容易出错, 直接给False。或者能确定casual能判断正确。

SylarTiaNII

需要修改

SylarTiaNII · 2024-03-28T10:53:22Z

llm/finetune_generation.py

+
+@dataclass
+@add_start_docstrings(ModelArgument.__doc__)
+class SFTModelArguments(ModelArgument):


整体挪到arguments.py里面，注意pylint。

SylarTiaNII · 2024-03-28T10:54:19Z

paddlenlp/transformers/llama/modeling.py

@@ -240,7 +240,7 @@ def scaled_dot_product_attention(
                    attention_mask is None,
                    True,
                    False,
-                    False,
+                    False


is_causal_mask能修复吗？

SylarTiaNII · 2024-03-28T10:55:45Z

paddlenlp/transformers/llama/modeling.py

-            if is_casual and alibi is None:
-                attention_mask = None
+            if get_env_device != "npu":
+                is_casual = is_casual_mask(attention_mask)


如果这里没问题，可以从这里下发一个变量传到FA的输入中。

SylarTiaNII · 2024-03-28T10:57:05Z

paddlenlp/transformers/sequence_parallel_utils.py

@@ -0,0 +1,550 @@
+# Copyright (c) 2022 PaddlePaddle Authors. All Rights Reserved.


为什么会新增文件？

codecov · 2024-03-30T08:03:26Z

Codecov Report

Attention: Patch coverage is 3.15789% with 92 lines in your changes are missing coverage. Please review.

Project coverage is 55.10%. Comparing base (2273ee7) to head (768f465).

❗ Current head 768f465 differs from pull request most recent head 150c01e. Consider uploading reports for the commit 150c01e to get more accurate results

Files	Patch %	Lines
...dlenlp/transformers/mc2_seqence_parallel_linear.py	0.00%	71 Missing ⚠️
paddlenlp/transformers/llama/modeling.py	12.50%	21 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8139      +/-   ##
===========================================
- Coverage    55.15%   55.10%   -0.05%     
===========================================
  Files          601      602       +1     
  Lines        91764    91850      +86     
===========================================
+ Hits         50611    50613       +2     
- Misses       41153    41237      +84

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ZHUI · 2024-04-01T13:54:14Z

paddlenlp/transformers/mc2_seqence_parallel_linear.py

+import os
+
+import paddle
+import paddle_custom_device


这个默认不安装吧

这个是NPU执行MC2才会引用到的文件

我知道，可是其他地方import到这个东西的话，会报错。

gongweibao · 2024-04-02T09:42:06Z

paddlenlp/transformers/llama/modeling.py

    ScatterOp,
    mark_as_sequence_parallel_parameter,
 )
+
+if int(os.getenv("MC2", 0)):


稍微复杂点？
FLAGS_NPU_MC2

判断一下设备，然后判断FLAGS

wawltor · 2024-04-03T11:10:50Z

llm/argument.py

@@ -102,6 +106,64 @@ class ModelArgument:
        default=None, metadata={"help": "Build-in pretrained model name or the path to local model."}
    )
    use_flash_attention: bool = field(default=False, metadata={"help": "Whether to use flash attention"})
+    tokenizer_name_or_path: Optional[str] = field(


后续建议run_pretrain.py也能直接引入arguments的选项

paddle-bot bot added the contributor label Mar 18, 2024

paddle-bot bot assigned wj-Mcat Mar 18, 2024

NINGBENZHE force-pushed the develop branch from 1ce33f5 to 9c448bf Compare March 21, 2024 07:30

wuhuachaocoding reviewed Mar 22, 2024

View reviewed changes

NINGBENZHE force-pushed the develop branch from 9c448bf to c6a2665 Compare March 27, 2024 08:44

SylarTiaNII suggested changes Mar 28, 2024

View reviewed changes

NINGBENZHE force-pushed the develop branch from c6a2665 to b3b0610 Compare March 30, 2024 07:38

ZHUI reviewed Apr 1, 2024

View reviewed changes

gongweibao reviewed Apr 2, 2024

View reviewed changes

NINGBENZHE closed this Apr 2, 2024

NINGBENZHE force-pushed the develop branch from b3b0610 to 4d661bc Compare April 2, 2024 13:25

NINGBENZHE reopened this Apr 2, 2024

NINGBENZHE force-pushed the develop branch 3 times, most recently from a356945 to 768f465 Compare April 3, 2024 08:39

add mc2 & finetune fused

150c01e

NINGBENZHE force-pushed the develop branch from 768f465 to 150c01e Compare April 3, 2024 10:14

wawltor reviewed Apr 3, 2024

View reviewed changes

wawltor merged commit ae7dc15 into PaddlePaddle:develop Apr 3, 2024
5 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add finetune fused & add mc2 #8139

add finetune fused & add mc2 #8139

NINGBENZHE commented Mar 18, 2024

paddle-bot bot commented Mar 18, 2024

wuhuachaocoding Mar 22, 2024

NINGBENZHE Mar 27, 2024

wuhuachaocoding Mar 22, 2024

NINGBENZHE Mar 27, 2024

wuhuachaocoding Mar 22, 2024

NINGBENZHE Mar 27, 2024

SylarTiaNII left a comment

SylarTiaNII Mar 28, 2024

SylarTiaNII Mar 28, 2024

SylarTiaNII Mar 28, 2024

SylarTiaNII Mar 28, 2024

codecov bot commented Mar 30, 2024 •

edited

Loading

ZHUI Apr 1, 2024

NINGBENZHE Apr 2, 2024

ZHUI Apr 2, 2024

gongweibao Apr 2, 2024 •

edited

Loading

gongweibao Apr 2, 2024

wawltor Apr 3, 2024

		@@ -0,0 +1,550 @@
		# Copyright (c) 2022 PaddlePaddle Authors. All Rights Reserved.

add finetune fused & add mc2 #8139

add finetune fused & add mc2 #8139

Conversation

NINGBENZHE commented Mar 18, 2024

paddle-bot bot commented Mar 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SylarTiaNII left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Mar 30, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gongweibao Apr 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Mar 30, 2024 •

edited

Loading

gongweibao Apr 2, 2024 •

edited

Loading