[LLM] Unify pipeline model with PretrainModelPipe #7095

ZHUI · 2023-09-20T09:46:06Z

PR types

New features

PR changes

Others

Description

Unify pipeline model with PretrainModelPipe

import paddle
import paddle.distributed.fleet as fleet
from paddlenlp.transformers import AutoModelForCausalLMPipe, AutoModelForCausalLM

world_size = paddle.distributed.get_world_size()

pp_degree = 2
tp_degree = world_size/2

strategy = fleet.DistributedStrategy()
strategy.hybrid_configs = {
    "dp_degree": 1,
    "mp_degree": tp_degree,
    "pp_degree": pp_degree,
    "sharding_degree": 1,
}
fleet.init(is_collective=True, strategy=strategy)
hcg = fleet.get_hybrid_communicate_group()

if pp_degree > 1:
    model_class = AutoModelForCausalLMPipe
else:
    model_class = AutoModelForCausalLM

model_name_or_path = "facebook/llama-7b"
# model_name_or_path = "__internal_testing__/tiny-random-llama"
model = model_class.from_pretrained(
    model_name_or_path,
    tensor_parallel_degree=tp_degree,
    tensor_parallel_rank=hcg.get_model_parallel_rank(),
    tensor_parallel_output=False,
)

model.eval()

paddle-bot · 2023-09-20T09:46:10Z

Thanks for your contribution!

codecov · 2023-09-20T10:24:15Z

Codecov Report

Merging #7095 (d7015d6) into develop (d2524ab) will decrease coverage by 0.29%.
Report is 8 commits behind head on develop.
The diff coverage is 34.58%.

@@             Coverage Diff             @@
##           develop    #7095      +/-   ##
===========================================
- Coverage    59.84%   59.55%   -0.29%     
===========================================
  Files          557      563       +6     
  Lines        82150    82775     +625     
===========================================
+ Hits         49161    49299     +138     
- Misses       32989    33476     +487

Files Changed	Coverage Δ
paddlenlp/transformers/model_utils.py	`64.03% <19.51%> (-4.08%)`	⬇️
paddlenlp/transformers/auto/modeling.py	`77.94% <25.00%> (-2.63%)`	⬇️
paddlenlp/transformers/llama/modeling_pp.py	`24.48% <70.00%> (ø)`
paddlenlp/transformers/__init__.py	`100.00% <100.00%> (ø)`
paddlenlp/transformers/gpt/__init__.py	`100.00% <100.00%> (ø)`
paddlenlp/transformers/gpt/configuration.py	`100.00% <100.00%> (ø)`
paddlenlp/transformers/gpt/modeling_pp.py	`31.39% <100.00%> (ø)`
paddlenlp/transformers/llama/__init__.py	`100.00% <100.00%> (ø)`
paddlenlp/transformers/llama/configuration.py	`100.00% <100.00%> (ø)`

... and 7 files with indirect coverage changes

wawltor · 2023-09-21T12:25:04Z

llm/gpt-3/run_pretrain.py

+    if not model_args.continue_training:
+        config.max_position_embeddings = max(config.max_position_embeddings, data_args.max_seq_length)
+
+    if not model_args.continue_training:


这里的vocab size的改动目的是什么

同时改动vocab size之后会对后续热启word embedding有影响吗

适配TP，随机初始化的才改

llm/gpt-3/run_pretrain.py

wawltor · 2023-09-21T12:53:52Z

paddlenlp/transformers/gpt/modeling_pp.py

@@ -268,7 +173,7 @@ def _logits_helper(embedding, output):
                shared_weight_attr="embedding_weight",
                config=config,
            ),
-            "gpt",
+            "gpt.embeddings",


影响精度

paddlenlp/transformers/model_utils.py

wawltor

LGTM

unify pipeline model with PretrainModelPipe.

3003d77

ZHUI requested review from DesmonDay and wawltor September 20, 2023 09:49

ZHUI added 5 commits September 21, 2023 11:51

unify gpt-3 modeling_pp.

9045c1b

Fix import GPTForCausalLMPipe

95cf714

Add comments for hack.

0bad123

fix gpt-3

0ddac4a

fix typo

152ae54

wawltor reviewed Sep 21, 2023

View reviewed changes

paddlenlp/transformers/model_utils.py Show resolved Hide resolved

fix

d7015d6

wawltor approved these changes Sep 22, 2023

View reviewed changes

ZHUI merged commit 51835e8 into develop Sep 22, 2023

ZHUI deleted the llm/unify_pipeline_model branch September 22, 2023 09:58

DrownFish19 mentioned this pull request Oct 11, 2023

[Question]: 跟着官网的gpt3训练出现乱码。 #7172

Closed

ZHUI mentioned this pull request Jan 2, 2024

PaddleNLP 2.7.0 Release Note Candidate #7753

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Unify pipeline model with PretrainModelPipe #7095

[LLM] Unify pipeline model with PretrainModelPipe #7095

ZHUI commented Sep 20, 2023 •

edited

Loading

paddle-bot bot commented Sep 20, 2023

codecov bot commented Sep 20, 2023 •

edited

Loading

wawltor Sep 21, 2023

wawltor Sep 21, 2023

ZHUI Sep 21, 2023

wawltor Sep 21, 2023

wawltor left a comment

[LLM] Unify pipeline model with PretrainModelPipe #7095

[LLM] Unify pipeline model with PretrainModelPipe #7095

Conversation

ZHUI commented Sep 20, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Sep 20, 2023

codecov bot commented Sep 20, 2023 • edited Loading

Codecov Report

wawltor Sep 21, 2023

Choose a reason for hiding this comment

wawltor Sep 21, 2023

Choose a reason for hiding this comment

ZHUI Sep 21, 2023

Choose a reason for hiding this comment

wawltor Sep 21, 2023

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

ZHUI commented Sep 20, 2023 •

edited

Loading

codecov bot commented Sep 20, 2023 •

edited

Loading