[Llama2] Fix gradient accumulation for Llama2 training in auto model and add uts #7625

haohongxiang · 2023-12-11T12:51:24Z

PR types

Bug fixes

PR changes

Others

Description

[Llama2] Fix gradient accumulation for Llama2 training in auto model and add uts

paddle-bot · 2023-12-11T12:51:30Z

Thanks for your contribution!

codecov · 2023-12-11T13:29:03Z

Codecov Report

Attention: 2 lines in your changes are missing coverage. Please review.

Comparison is base (e6acb0e) 57.85% compared to head (2119b50) 57.85%.

Files	Patch %	Lines
paddlenlp/trainer/training_args.py	66.66%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #7625      +/-   ##
===========================================
- Coverage    57.85%   57.85%   -0.01%     
===========================================
  Files          582      582              
  Lines        86485    86489       +4     
===========================================
+ Hits         50038    50040       +2     
- Misses       36447    36449       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

zhaoyinglia

LGTM

ZHUI · 2023-12-12T02:55:14Z

paddlenlp/trainer/training_args.py

            return max(self.sharding_parallel_degree, 1) * self.data_parallel_rank + self.sharding_parallel_rank
+        elif self.use_auto_parallel:
+            return self.data_parallel_rank


自动并行没有sharding么?

和动手有区别，自动并行的sharding是数据并行的一个优化，所以在取数据并行相关的维度时不需要考虑sharding维度

haohongxiang force-pushed the fix_grad_acc_in_llama_auto branch 5 times, most recently from bdfd12a to d7ba41b Compare December 11, 2023 23:51

fix grads acc in llama auto

2119b50

haohongxiang force-pushed the fix_grad_acc_in_llama_auto branch from d7ba41b to 2119b50 Compare December 11, 2023 23:58

zhaoyinglia approved these changes Dec 12, 2023

View reviewed changes

zhaoyinglia merged commit a4ed7ac into PaddlePaddle:develop Dec 12, 2023
9 checks passed

ZHUI reviewed Dec 12, 2023

View reviewed changes

ZHUI mentioned this pull request Jan 2, 2024

PaddleNLP 2.7.0 Release Note Candidate #7753

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Llama2] Fix gradient accumulation for Llama2 training in auto model and add uts #7625

[Llama2] Fix gradient accumulation for Llama2 training in auto model and add uts #7625

haohongxiang commented Dec 11, 2023

paddle-bot bot commented Dec 11, 2023

codecov bot commented Dec 11, 2023 •

edited

Loading

zhaoyinglia left a comment

ZHUI Dec 12, 2023

zhaoyinglia Dec 12, 2023

[Llama2] Fix gradient accumulation for Llama2 training in auto model and add uts #7625

[Llama2] Fix gradient accumulation for Llama2 training in auto model and add uts #7625

Conversation

haohongxiang commented Dec 11, 2023

PR types

PR changes

Description

paddle-bot bot commented Dec 11, 2023

codecov bot commented Dec 11, 2023 • edited Loading

Codecov Report

zhaoyinglia left a comment

Choose a reason for hiding this comment

ZHUI Dec 12, 2023

Choose a reason for hiding this comment

zhaoyinglia Dec 12, 2023

Choose a reason for hiding this comment

codecov bot commented Dec 11, 2023 •

edited

Loading