LLAMA3 70B: LoRa enabled in all modules instead of only LinearQKV by rhmukundan · Pull Request #2181 · NVIDIA-NeMo/Megatron-Bridge

rhmukundan · 2026-02-02T22:23:54Z

Disable LoRa in linear_proj, linear_fc1, and linear_fc2, retaining it solely in linear_qkv.

Summary by CodeRabbit

Chores
- Refined Llama 3 70B LoRA fine-tuning configuration to target specific model components, improving precision and efficiency during model adaptation.

Signed-off-by: Raghav Hrishikeshan Mukundan <rmukundan@nvidia.com>

copy-pr-bot · 2026-02-02T22:23:58Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2026-02-02T22:27:20Z

📝 Walkthrough

Walkthrough

This change adds a LoRA configuration override to three Llama 3 70B model configuration functions, constraining PEFT target_modules to QKV (Query, Key, Value) modules. The modification is a straightforward configuration addition with no control flow changes.

Changes

Cohort / File(s)	Summary
LoRA Configuration Override `scripts/performance/configs/llama/llama3_llm_finetune.py`	Adds target_modules constraint to QKV across gb300, gb200, and h100 Llama 3 70B LoRA configuration functions, refining PEFT application scope.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The PR title claims LoRa is being enabled in all modules, but the description and changes clarify it's actually being constrained to only linear_qkv, contradicting the title's assertion.	Revise the title to accurately reflect the change, such as: 'LLAMA3 70B: LoRa configuration constrained to only linear_qkv modules' or similar phrasing that matches the actual implementation.
Test Results For Major Changes	⚠️ Warning	PR contains significant LoRA configuration change but lacks documentation of test results, convergence metrics, or performance validation.	Add test results to PR description including convergence behavior, before-and-after loss curves, and performance impact with specific testing configuration details.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch rhmukundan/llama3_70b_qkv_only_lora

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

rhmukundan · 2026-02-04T18:33:00Z

/ok to test b22d740

rhmukundan · 2026-02-06T16:49:14Z

/ok to test 04a9849

rhmukundan · 2026-02-10T16:57:33Z

/ok to test 71db322

LLAMA3 70B: LoRa enabled in all modules instead of only LinearQKV

549cb73

Signed-off-by: Raghav Hrishikeshan Mukundan <rmukundan@nvidia.com>

rhmukundan requested a review from malay-nagda February 2, 2026 22:23

rhmukundan self-assigned this Feb 2, 2026

rhmukundan mentioned this pull request Feb 2, 2026

LLAMA3 70B: LoRa enabled in all modules instead of only LinearQKV NVIDIA/Megatron-LM#3210

Closed

rhmukundan requested a review from erhoo82 February 2, 2026 22:24

rhmukundan mentioned this pull request Feb 3, 2026

LLAMA3 70B: LoRa enabled in all modules instead of only LinearQKV #2195

Open

Merge branch 'main' into rhmukundan/llama3_70b_qkv_only_lora

df4e7cb

erhoo82 added the r0.3.0 Cherry-pick label for r0.3.0 release branch label Feb 3, 2026

erhoo82 added this to the 26.02 milestone Feb 3, 2026

malay-nagda approved these changes Feb 3, 2026

View reviewed changes

rhmukundan added 2 commits February 3, 2026 10:39

Merge branch 'main' into rhmukundan/llama3_70b_qkv_only_lora

ef0f3cf

Merge branch 'main' into rhmukundan/llama3_70b_qkv_only_lora

b22d740

rhmukundan enabled auto-merge (squash) February 4, 2026 17:11

copy-pr-bot bot temporarily deployed to nemo-ci February 4, 2026 18:33 Inactive

copy-pr-bot bot temporarily deployed to test February 4, 2026 18:33 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 4, 2026 18:45 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 4, 2026 18:52 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 4, 2026 19:03 Error

copy-pr-bot bot temporarily deployed to nemo-ci February 4, 2026 19:03 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 4, 2026 19:03 Error

copy-pr-bot bot temporarily deployed to nemo-ci February 4, 2026 19:03 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 4, 2026 20:57 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 4, 2026 20:57 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 4, 2026 20:57 Inactive

rhmukundan added 2 commits February 4, 2026 13:39

Merge branch 'main' into rhmukundan/llama3_70b_qkv_only_lora

925ac2e

Merge branch 'main' into rhmukundan/llama3_70b_qkv_only_lora

04a9849

rhmukundan mentioned this pull request Feb 6, 2026

[Test] Fix LoRA perf configurations of different GPUs #2061

Closed

Merge branch 'main' into rhmukundan/llama3_70b_qkv_only_lora

71db322

coderabbitai bot mentioned this pull request Feb 10, 2026

cp: LLAMA3 70B: LoRa enabled in all modules instead of only LinearQKV (2181) into r0.3.0 #2310

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLAMA3 70B: LoRa enabled in all modules instead of only LinearQKV#2181

LLAMA3 70B: LoRa enabled in all modules instead of only LinearQKV#2181
ko3n1g merged 8 commits intomainfrom
rhmukundan/llama3_70b_qkv_only_lora

rhmukundan commented Feb 2, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

copy-pr-bot bot commented Feb 2, 2026

Uh oh!

coderabbitai bot commented Feb 2, 2026

Walkthrough

Changes

Estimated code review effort

Uh oh!

rhmukundan commented Feb 4, 2026

Uh oh!

rhmukundan commented Feb 6, 2026

Uh oh!

rhmukundan commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rhmukundan commented Feb 2, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

copy-pr-bot bot commented Feb 2, 2026

Uh oh!

coderabbitai bot commented Feb 2, 2026

Walkthrough

Changes

Estimated code review effort

Uh oh!

rhmukundan commented Feb 4, 2026

Uh oh!

rhmukundan commented Feb 6, 2026

Uh oh!

rhmukundan commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rhmukundan commented Feb 2, 2026 •

edited by coderabbitai bot

Loading