Support Sharding Overlap #8473

iosmers · 2024-05-21T09:26:11Z

PR types

Performance optimization

PR changes

Models

Description

1.支持sharding overlap

paddle-bot · 2024-05-21T09:26:15Z

Thanks for your contribution!

ZHUI · 2024-05-21T11:17:51Z

paddlenlp/transformers/llama/modeling.py

        if self.config.use_flash_attention and get_env_device() != "gcu":
-            is_casual = is_casual_mask(attention_mask)


不要删除这个。if hasattr(self.config, "casual_mask")

ZHUI · 2024-05-21T11:18:42Z

paddlenlp/trainer/trainer.py

@@ -1908,6 +1907,13 @@ def get_expected_keys(inputs, keys):
                self.optimizer = mix_precision_utils.MixPrecisionOptimizer(self.optimizer)
            self.optimizer = fleet.distributed_optimizer(self.optimizer)

+        if in_sharding_parallel_mode:
+            sharding_parallel_config = set(self.args.sharding_parallel_config.split(" "))


training_args 文件里面处理好，就别在这里split了

ZHUI · 2024-05-21T11:19:02Z

llm/run_pretrain.py

@@ -223,6 +223,10 @@ class ModelArguments:
        default=None,
        metadata={"help": "num_hidden_layers."},
    )
+    casual_mask: Optional[bool] = field(


Suggested change

casual_mask: Optional[bool] = field(

use_casual_mask: Optional[bool] = field(

codecov · 2024-05-21T12:35:25Z

Codecov Report

Attention: Patch coverage is 44.44444% with 10 lines in your changes missing coverage. Please review.

Project coverage is 54.25%. Comparing base (87e4c4f) to head (594a050).
Report is 220 commits behind head on develop.

Files	Patch %	Lines
paddlenlp/trainer/trainer.py	0.00%	6 Missing ⚠️
paddlenlp/transformers/llama/modeling.py	66.66%	4 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8473      +/-   ##
===========================================
- Coverage    54.29%   54.25%   -0.05%     
===========================================
  Files          617      617              
  Lines        96339    96368      +29     
===========================================
- Hits         52312    52288      -24     
- Misses       44027    44080      +53

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ZHUI · 2024-05-22T03:17:18Z

paddlenlp/trainer/trainer.py

@@ -1908,6 +1907,12 @@ def get_expected_keys(inputs, keys):
                self.optimizer = mix_precision_utils.MixPrecisionOptimizer(self.optimizer)
            self.optimizer = fleet.distributed_optimizer(self.optimizer)

+        if in_sharding_parallel_mode:
+            if "split_param" in self.args.sharding_parallel_config:
+                self.optimizer._set_all_gather_overlap_forward(True, model)


这个接口需要考虑版本兼容不？

ZHUI

LGTM

CLAassistant · 2024-05-23T03:34:06Z

All committers have signed the CLA.

ZHUI

LGTM

wawltor

LGTM

This reverts commit 7aaa788.

ZHUI requested changes May 21, 2024

View reviewed changes

ZHUI previously approved these changes May 22, 2024

View reviewed changes

iosmers dismissed ZHUI’s stale review via f1d68f6 May 22, 2024 04:00

ZHUI previously approved these changes May 22, 2024

View reviewed changes

iosmers dismissed ZHUI’s stale review via f63d157 May 23, 2024 03:34

sneaxiy previously approved these changes May 23, 2024

View reviewed changes

iosmers force-pushed the sharding_overlap branch from f63d157 to 06019c4 Compare May 23, 2024 05:04

update

92b106f

iosmers force-pushed the sharding_overlap branch from 06019c4 to 92b106f Compare May 23, 2024 05:06

update is_casual_mask to use_casual_mask

370d2c9

iosmers dismissed sneaxiy’s stale review via 370d2c9 May 23, 2024 05:46

ZHUI previously approved these changes May 23, 2024

View reviewed changes

update by environment

594a050

iosmers dismissed ZHUI’s stale review via 594a050 May 23, 2024 12:01

wawltor approved these changes May 23, 2024

View reviewed changes

wawltor merged commit 7aaa788 into PaddlePaddle:develop May 24, 2024
8 of 12 checks passed

SylarTiaNII added a commit to SylarTiaNII/PaddleNLP that referenced this pull request May 24, 2024

Revert "Support Sharding Overlap (PaddlePaddle#8473)"

3a34b65

This reverts commit 7aaa788.

wawltor pushed a commit that referenced this pull request May 24, 2024

Revert "Support Sharding Overlap (#8473)" (#8491)

0cd8fe7

This reverts commit 7aaa788.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Sharding Overlap #8473

Support Sharding Overlap #8473

iosmers commented May 21, 2024

paddle-bot bot commented May 21, 2024

ZHUI May 21, 2024

ZHUI May 21, 2024

ZHUI May 21, 2024

codecov bot commented May 21, 2024 •

edited

Loading

ZHUI May 22, 2024

ZHUI left a comment

CLAassistant commented May 23, 2024 •

edited

Loading

ZHUI left a comment

wawltor left a comment

		if self.config.use_flash_attention and get_env_device() != "gcu":
		is_casual = is_casual_mask(attention_mask)

	casual_mask: Optional[bool] = field(
	use_casual_mask: Optional[bool] = field(

Support Sharding Overlap #8473

Support Sharding Overlap #8473

Conversation

iosmers commented May 21, 2024

PR types

PR changes

Description

paddle-bot bot commented May 21, 2024

ZHUI May 21, 2024

Choose a reason for hiding this comment

ZHUI May 21, 2024

Choose a reason for hiding this comment

ZHUI May 21, 2024

Choose a reason for hiding this comment

codecov bot commented May 21, 2024 • edited Loading

Codecov Report

ZHUI May 22, 2024

Choose a reason for hiding this comment

ZHUI left a comment

Choose a reason for hiding this comment

CLAassistant commented May 23, 2024 • edited Loading

ZHUI left a comment

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

codecov bot commented May 21, 2024 •

edited

Loading

CLAassistant commented May 23, 2024 •

edited

Loading