Glm4Moe: fix attn_mask && fused_loss #2648

WYB27 · 2025-09-20T04:27:59Z

改动：

修复attn_mask_startend_row_indices没有正确传入的问题
修复开启use_fused_head_and_loss_fn后转置参数传递错误的问题

paddle-bot · 2025-09-20T04:28:05Z

Thanks for your contribution!

codecov-commenter · 2025-09-20T04:39:05Z

Codecov Report

❌ Patch coverage is 0% with 4 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (develop@dc9ac0e). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
paddleformers/transformers/glm4_moe/modeling.py	0.00%	3 Missing ⚠️
paddleformers/trl/model_config.py	0.00%	1 Missing ⚠️

❌ Your patch status has failed because the patch coverage (0.00%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

@@            Coverage Diff             @@
##             develop    #2648   +/-   ##
==========================================
  Coverage           ?   29.52%           
==========================================
  Files              ?      311           
  Lines              ?    54249           
  Branches           ?        0           
==========================================
  Hits               ?    16017           
  Misses             ?    38232           
  Partials           ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

lugimzzz · 2025-09-22T08:46:48Z

examples/run_finetune.py

    model_config.num_nextn_predict_layers = model_args.num_nextn_predict_layers
    model_config._attn_implementation = model_args.attn_impl
+    if training_args.use_expert_parallel and training_args.expert_parallel_degree >= 1:
+        model_config.n_group = training_args.expert_parallel_degree


model_config.n_group 不是每个模型都叫n_group，看看怎么处理通用化一些

fix attn_mask && fused_loss

6fbd35e

paddle-bot bot added the contributor label Sep 20, 2025

WYB27 added 3 commits September 20, 2025 17:03

add fused_head_and_loss_fn config && fix unit test

d9def11

add fused_head_and_loss_fn config && fix unit test

54ae5ef

add fused_head_and_loss_fn config && fix unit test

275eb6b

lugimzzz reviewed Sep 22, 2025

View reviewed changes

danleifeng mentioned this pull request Sep 23, 2025

【GLM】subbatch performance and weight bug fix #2661

Merged

2 tasks

danleifeng mentioned this pull request Oct 22, 2025

fix fused_head_and_loss_fn bug #2774

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Glm4Moe: fix attn_mask && fused_loss #2648

Glm4Moe: fix attn_mask && fused_loss #2648

Uh oh!

WYB27 commented Sep 20, 2025

Uh oh!

paddle-bot bot commented Sep 20, 2025

Uh oh!

codecov-commenter commented Sep 20, 2025 •

edited

Loading

Uh oh!

lugimzzz Sep 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Glm4Moe: fix attn_mask && fused_loss #2648

Are you sure you want to change the base?

Glm4Moe: fix attn_mask && fused_loss #2648

Uh oh!

Conversation

WYB27 commented Sep 20, 2025

Uh oh!

paddle-bot bot commented Sep 20, 2025

Uh oh!

codecov-commenter commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lugimzzz Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Sep 20, 2025 •

edited

Loading