feat: Support PEFT weight mapping and merge LoRA adapters when export to hf by HollowMan6 · Pull Request #1310 · NVIDIA-NeMo/Megatron-Bridge

HollowMan6 · 2025-11-12T11:57:25Z

What does this PR do ?

Now we support PEFT weight mapping (to_warp), and ignore those adapters export for now, but merge LoRA adapters to the base weights when export to hf. The linear_in tensors for LoRA adapters needs to be gathered before we merge lora using the transformation.

Changelog

Support PEFT weight mapping
Merge LoRA adapters when export to hf

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to [peft] Add support for LoRA merge #418
Resolves Support load_hf_weights and export_hf_weights for PEFT/LoRA enabled models #1272

_{✨ Presented to you with Mind Lab - A Lab for Experiential Intelligence.}

copy-pr-bot · 2025-11-12T11:57:28Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

yaoyu-33 · 2025-11-13T03:23:43Z

@HollowMan6 : we have a plan to do this in 2 steps. We are going to provide a merging script, then you can use regular export to export the merged weights.
Mixing together will reduce the readability of the code. Is there a strong reason you need to do it in single step?

HollowMan6 · 2025-11-13T07:58:19Z

Is there a strong reason you need to do it in single step?

@yaoyu-33 Yes, in RL training, it’s better to do it in a single step so that merged model weights can be updated to rollout engines in a streaming manner without any hassle. Please check verl-project/verl#4063 (comment)

yaoyu-33 · 2025-11-13T18:38:31Z

@HollowMan6 : we understand the need now, can help you merge this.
Please check our comments, see if it makes sense to you.

cuichenx

Thanks for your contribution! The overall logic and structure look good to me. Some comments below. Could you also add a unit test that covers the newly added code in model bridge?

src/megatron/bridge/models/conversion/model_bridge.py

src/megatron/bridge/peft/lora.py

src/megatron/bridge/models/conversion/model_bridge.py

src/megatron/bridge/models/conversion/param_mapping.py

… to hf Signed-off-by: Hollow Man <hollowman@opensuse.org>

HollowMan6 · 2025-11-13T22:40:56Z

Thank you for reviewing @cuichenx and @yaoyu-33 ! I have just addressed all your change requests and added straightforward unit tests for this feature. Please let me know if you still have any other concerns or suggestions, thanks!

cuichenx

LGTM

cuichenx · 2025-11-14T00:17:23Z

/ok to test 01599fc

Signed-off-by: Chen Cui <chcui@nvidia.com>

cuichenx · 2025-11-14T05:52:05Z

/ok to test 33eeac6

… to hf (#1310) Signed-off-by: Hollow Man <hollowman@opensuse.org> Signed-off-by: Chen Cui <chcui@nvidia.com> Co-authored-by: Chen Cui <chcui@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

… to hf (NVIDIA-NeMo#1310) Signed-off-by: Hollow Man <hollowman@opensuse.org> Signed-off-by: Chen Cui <chcui@nvidia.com> Co-authored-by: Chen Cui <chcui@nvidia.com>

github-actions bot added the community-request label Nov 12, 2025

HollowMan6 mentioned this pull request Nov 12, 2025

[megatron] feat: Integrate Megatron-Bridge and support LoRA/PEFT verl-project/verl#4063

Merged

7 tasks

ananthsub requested review from cuichenx and yaoyu-33 November 12, 2025 13:32

cuichenx reviewed Nov 13, 2025

View reviewed changes

ananthsub linked an issue Nov 13, 2025 that may be closed by this pull request

Add hf-peft-exporting for GPT OSS model recipes #448

Closed

feat: Support PEFT weight mapping and merge LoRA adapters when export…

01599fc

… to hf Signed-off-by: Hollow Man <hollowman@opensuse.org>

HollowMan6 requested a review from cuichenx November 13, 2025 22:42

cuichenx previously approved these changes Nov 14, 2025

View reviewed changes

ananthsub added the r0.2.0 Cherry-pick label for r0.2.0 release branch label Nov 14, 2025

copy-pr-bot bot temporarily deployed to nemo-ci November 14, 2025 00:17 Inactive

lint

8191df9

Signed-off-by: Chen Cui <chcui@nvidia.com>

cuichenx dismissed their stale review via 8191df9 November 14, 2025 05:51

copyright

33eeac6

Signed-off-by: Chen Cui <chcui@nvidia.com>

copy-pr-bot bot temporarily deployed to nemo-ci November 14, 2025 05:52 Inactive

copy-pr-bot bot temporarily deployed to test November 14, 2025 05:52 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 14, 2025 06:25 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 14, 2025 06:27 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 14, 2025 06:32 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 14, 2025 17:08 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 14, 2025 17:09 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 14, 2025 17:14 Inactive

cuichenx approved these changes Nov 14, 2025

View reviewed changes

cuichenx merged commit 42d856c into NVIDIA-NeMo:main Nov 14, 2025
42 checks passed

HollowMan6 deleted the lora_merge branch November 14, 2025 18:56

HollowMan6 mentioned this pull request Nov 16, 2025

MoE LoRA on Expert Modules (fc1/fc2) + EP Causes Hang #1363

Closed

HollowMan6 mentioned this pull request Dec 5, 2025

[LoRA] Fix LoRA merge and support CanonicalLoRA merge #1603

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support PEFT weight mapping and merge LoRA adapters when export to hf#1310

feat: Support PEFT weight mapping and merge LoRA adapters when export to hf#1310
cuichenx merged 4 commits intoNVIDIA-NeMo:mainfrom
HollowMan6:lora_merge

HollowMan6 commented Nov 12, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Nov 12, 2025

Uh oh!

yaoyu-33 commented Nov 13, 2025

Uh oh!

HollowMan6 commented Nov 13, 2025

Uh oh!

yaoyu-33 commented Nov 13, 2025

Uh oh!

cuichenx left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HollowMan6 commented Nov 13, 2025

Uh oh!

cuichenx left a comment

Uh oh!

cuichenx commented Nov 14, 2025

Uh oh!

cuichenx commented Nov 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

HollowMan6 commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot bot commented Nov 12, 2025

Uh oh!

yaoyu-33 commented Nov 13, 2025

Uh oh!

HollowMan6 commented Nov 13, 2025

Uh oh!

yaoyu-33 commented Nov 13, 2025

Uh oh!

cuichenx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HollowMan6 commented Nov 13, 2025

Uh oh!

cuichenx left a comment

Choose a reason for hiding this comment

Uh oh!

cuichenx commented Nov 14, 2025

Uh oh!

cuichenx commented Nov 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HollowMan6 commented Nov 12, 2025 •

edited

Loading