Skip to content

cp: fix: use local_rank (#2328)#2329

Closed
ko3n1g wants to merge 34 commits intomainfrom
r0.3.0
Closed

cp: fix: use local_rank (#2328)#2329
ko3n1g wants to merge 34 commits intomainfrom
r0.3.0

Conversation

@ko3n1g
Copy link
Copy Markdown
Contributor

@ko3n1g ko3n1g commented Feb 11, 2026

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

  • Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

  • Related to # (issue)

Summary by CodeRabbit

  • New Features

    • Added Multi-Token Prediction documentation with configuration guidance and examples.
    • Added Ministral3 Vision-Language Model support with examples and recipes.
    • Added GLM-4.5V examples with conversion, inference, and finetuning scripts.
    • Introduced packed sequence support for vision-language model training with validation.
    • Added PEFT (LoRA/DoRA) finetuning for Qwen3-VL models.
  • Bug Fixes

    • Fixed VLM forward pass compatibility for multiple return types.
    • Fixed Ministral3 image feature extraction to use pooler output.
    • Fixed inference wrapper decoder exposure for Qwen models.
    • Addressed CVE-2025-68973 in Docker image.
  • Documentation

    • Updated release version to 0.3.0.
    • Enhanced GLM-4.5V and Qwen3-VL documentation with PEFT examples.
    • Updated documentation links and references for vision-language models.
  • Tests

    • Added packed sequence finetuning tests for multiple models.
    • Added Qwen3-VL finetuning test suite.
    • Added validation tests for packed sequence configurations.
  • Chores

    • Updated dependencies and GitHub workflows.
    • Updated Megatron-LM submodule.
    • Adjusted parallelism configurations for performance tuning.
    • Enhanced shell scripts for model workflows.

ko3n1g and others added 30 commits February 2, 2026 15:52
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: Dingqing Yang <dingqingy@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Dingqing Yang <dingqingy@nvidia.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Chen Cui <chcui@nvidia.com>
…300 FP8-CS (2175)` into `r0.3.0` (#2198)

Signed-off-by: Malay Nagda <malayn@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: malay-nagda <malayn@nvidia.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Chen Cui <cxcui@alumni.cmu.edu>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Chen Cui <chcui@nvidia.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Yashaswi Karnati <144376261+yashaswikarnati@users.noreply.github.com>
…L docs (2151)` into `r0.3.0` (#2226)

Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
Signed-off-by: Ao Tang <aot@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: Abhishree <abhishreetm@gmail.com>
Signed-off-by: Dingqing Yang <dingqingy@nvidia.com>
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Malay Nagda <malayn@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
Co-authored-by: Ao Tang <aot@nvidia.com>
Co-authored-by: Ananth Subramaniam <ansubramania@nvidia.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Abhishree Thittenamane <47577437+athitten@users.noreply.github.com>
Co-authored-by: Dingqing Yang <dingqingy@nvidia.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Chen Cui <chcui@nvidia.com>
Co-authored-by: malay-nagda <malayn@nvidia.com>
Co-authored-by: Charlie Truong <chtruong@nvidia.com>
)

Signed-off-by: Kamran Jafari <kjafarisadeg@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: kamran-nvidia <kjafarisadeg@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
… `r0.3.0` (#2205)

Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: meatybobby <meatybobby@gmail.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Signed-off-by: Malay Nagda <malayn@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: malay-nagda <malayn@nvidia.com>
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Youngeun Kwon <youngeunk@nvidia.com>
…ad norm (2209)` into `r0.3.0` (#2210)

Signed-off-by: Dingqing Yang <dingqingy@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Dingqing Yang <dingqingy@nvidia.com>
Signed-off-by: Malay Nagda <malayn@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: malay-nagda <malayn@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Co-authored-by: yaoyu-33 <yaoyu.094@gmail.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
…e NaN grad norm (2209)` into `r0.3.0` (#2210)"

This reverts commit d7a13b1.
…ve NaN grad norm (2209)` into `r0.3.0` (#2210)"

This reverts commit 34aec47.
…nd-2209

Ko3n1g/chore/reapply 2152 and 2209
Signed-off-by: Malay Nagda <malayn@nvidia.com>
Co-authored-by: malay-nagda <malayn@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
…d for example (2283)` into `r0.3.0` (#2291)

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant