Skip to content

Conversation

@chang-wenbin
Copy link
Collaborator

@chang-wenbin chang-wenbin commented Aug 28, 2025

1、qkv_a_proj horizontal fusion end-to-end acceleration 10%.
2、Support V0 load & V1 load.
3、Fixed v0 load missing bias issue.
4、Added deepseek-v3 model unit test based on v0 load vs v1 load accuracy test.
5、Support DeepSeek-V3 & V3.1 huggingface model loading.

@paddle-bot
Copy link

paddle-bot bot commented Aug 28, 2025

Thanks for your contribution!

@chang-wenbin chang-wenbin changed the title 【Inference Optimize】support MergedReplicatedLinear 【Inference Optimize】Update MergedReplicatedLinear for DSK qkv_a_proj_with_mqa. Aug 28, 2025
zhoutianzi666
zhoutianzi666 previously approved these changes Aug 28, 2025
@codecov-commenter
Copy link

Codecov Report

✅ All modified and coverable lines are covered by tests.
⚠️ Please upload report for BASE (develop@b791bea). Learn more about missing BASE report.

Additional details and impacted files
@@            Coverage Diff             @@
##             develop    #3673   +/-   ##
==========================================
  Coverage           ?   50.00%           
==========================================
  Files              ?        2           
  Lines              ?        4           
  Branches           ?        0           
==========================================
  Hits               ?        2           
  Misses             ?        2           
  Partials           ?        0           
Flag Coverage Δ
diff 50.00% <ø> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

bukejiyu
bukejiyu previously approved these changes Sep 1, 2025
@yuanlehome yuanlehome merged commit 41aee08 into PaddlePaddle:develop Sep 5, 2025
25 of 28 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants