add long sequence strategies #8076

WAI-clear · 2024-03-08T04:19:00Z

PR types

PR changes

Models、APIs

Description

将长序列方案和模型解耦

…nto develop

paddle-bot · 2024-03-08T04:19:05Z

Thanks for your contribution!

codecov · 2024-03-15T06:07:20Z

Codecov Report

Attention: Patch coverage is 43.16940% with 104 lines in your changes are missing coverage. Please review.

Project coverage is 55.41%. Comparing base (db49062) to head (dc8da0a).
Report is 52 commits behind head on develop.

Files	Patch %	Lines
...s/long_sequence_strategies/embedding_strategies.py	25.39%	47 Missing ⚠️
...s/long_sequence_strategies/attention_strategies.py	37.50%	15 Missing ⚠️
...ng_sequence_strategies/long_sequence_strategies.py	31.25%	11 Missing ⚠️
paddlenlp/transformers/llama/modeling.py	35.71%	9 Missing ⚠️
paddlenlp/transformers/chatglm/modeling.py	41.66%	7 Missing ⚠️
paddlenlp/transformers/bloom/modeling.py	50.00%	6 Missing ⚠️
paddlenlp/transformers/chatglm_v2/modeling.py	45.45%	6 Missing ⚠️
paddlenlp/transformers/qwen/modeling.py	62.50%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8076      +/-   ##
===========================================
- Coverage    56.56%   55.41%   -1.16%     
===========================================
  Files          589      600      +11     
  Lines        89964    91642    +1678     
===========================================
- Hits         50889    50782     -107     
- Misses       39075    40860    +1785

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

gongel · 2024-03-18T02:46:47Z

llm/finetune_generation.py

@@ -152,7 +152,7 @@ def main():
        )
        if hasattr(model_config, "use_flash_attention"):
            model_config.use_flash_attention = model_args.use_flash_attention
-
+            


这个文件不需要被修改吧

gongel · 2024-03-18T02:47:22Z

llm/llama/sft_argument.json

+  "zero_padding": false,
+  "use_flash_attention": false
+}


这个json也不需要被修改

gongel · 2024-03-18T02:48:49Z

paddlenlp/transformers/LongSequenceStrategies/AttentionStrategies.py

+
+class AttentionWithLinearBias(nn.Layer):
+    """
+    init_args:bool_attention_mask,num_heads,dtype,tensor_parallel_degree


docstring注释的格式注意下，可以参考：https://github.com/PaddlePaddle/PaddleNLP/blob/v2.7.2/paddlenlp/transformers/model_utils.py#L807-L842

gongel · 2024-03-18T02:50:27Z

paddlenlp/transformers/LongSequenceStrategies/AttentionStrategies.py

+                + self._get_interleave(2 * closest_power_of_2)[0::2][: n - closest_power_of_2]
+            )
+
+    def forward(self, bool_attention_mask: Tensor, num_heads: int, dtype: paddle.dtype, tensor_parallel_degree=1):


传入tensor_parallel_degree的用处是？

gongel · 2024-03-18T02:51:22Z

paddlenlp/transformers/LongSequenceStrategies/AttentionStrategies.py

+    def _get_interleave(self, n):
+        def _get_interleave_power_of_2(n):
+            start = 2 ** (-(2 ** -(math.log2(n) - 3)))
+            ratio = start


这个ratio和start相等？是否可以复用

gongel · 2024-03-18T02:53:24Z

paddlenlp/transformers/LongSequenceStrategies/LongSequenceStrategies.py

+        """
+        try:
+            import_class = importlib.import_module(f"paddlenlp.transformers.LongSequenceStrategies.{strategy_type}")
+        except ValueError:


应该是ModuleNotFoundError？

gongel · 2024-03-18T02:54:12Z

paddlenlp/transformers/LongSequenceStrategies/LongSequenceStrategies.py

+            strategy_class = getattr(import_class, stratety_name)
+            strategy_instance = strategy_class(**init_args)
+            return strategy_instance
+        except AttributeError:


如果是strategy_class，报的错误是AttributeError？

JunnYu · 2024-03-22T08:59:10Z

paddlenlp/transformers/LongSequenceStrategies/AttentionStrategies.py

@@ -0,0 +1,49 @@
+# Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.


文件命名，目录命名，小写

WAI-clear added 4 commits November 8, 2023 19:22

test

cd8b643

remove test.txt

58dde75

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

58b66a2

…nto develop

add long sequence strategies

c5acdc2

paddle-bot bot added the contributor label Mar 8, 2024

paddle-bot bot assigned guoshengCS Mar 8, 2024

gongel self-requested a review March 8, 2024 04:23

WAI-clear added 10 commits March 11, 2024 12:55

long sequence V1

c0c950a

draft

4d2b599

draft

fbfe654

draft new

fa66a5f

add long sequence stratiges

4e2ffc9

add long sequence strategies new

6bf1282

fix format

4aa5917

fix conflict

c46527e

fix format

c202d89

fix error

fe57746

WAI-clear added 8 commits March 15, 2024 06:11

fix error

1481944

fix format

56e014a

fix format

d245ad4

fix format

d51de39

fix format

dada723

close @slow

32033f3

fix test

e1b0ff9

fix error

dcb712f

gongel reviewed Mar 18, 2024

View reviewed changes

WAI-clear added 2 commits March 18, 2024 05:36

modify try_catch

a00fcee

fix format

d88b4f4

WAI-clear added 4 commits March 18, 2024 05:39

fix format

386bca6

fix format

506b2cf

add bloom_alibi

7176f9f

fix error

0c788ad

JunnYu reviewed Mar 22, 2024

View reviewed changes

WAI-clear added 2 commits March 22, 2024 15:03

add dynamic_to_static

ef02a24

add dynamic_to_static

dc8da0a

wawltor merged commit 6b5099a into PaddlePaddle:develop Mar 26, 2024
7 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add long sequence strategies #8076

add long sequence strategies #8076

WAI-clear commented Mar 8, 2024

paddle-bot bot commented Mar 8, 2024

codecov bot commented Mar 15, 2024 •

edited

Loading

gongel Mar 18, 2024

gongel Mar 18, 2024

gongel Mar 18, 2024

gongel Mar 18, 2024

gongel Mar 18, 2024

gongel Mar 18, 2024

gongel Mar 18, 2024

JunnYu Mar 22, 2024

		@@ -0,0 +1,49 @@
		# Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.

add long sequence strategies #8076

add long sequence strategies #8076

Conversation

WAI-clear commented Mar 8, 2024

PR types

PR changes

Description

paddle-bot bot commented Mar 8, 2024

codecov bot commented Mar 15, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Mar 15, 2024 •

edited

Loading