Revert packed seq extra checks by cuichenx · Pull Request #2180 · NVIDIA-NeMo/Megatron-Bridge

cuichenx · 2026-02-02T21:49:58Z

What does this PR do ?

Packed sequence changes in #1997 resulted in perf degradation.

Changelog

Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

Summary by CodeRabbit

Refactor
- Simplified internal sequence handling logic by streamlining the slicing pattern, reducing dependency on padding assumptions and improving code clarity.

Signed-off-by: Chen Cui <chcui@nvidia.com>

coderabbitai · 2026-02-02T21:54:36Z

📝 Walkthrough

Walkthrough

Refactors slicing logic for cu_seqlens sequences in packed sequence utilities to use argmin-based approach instead of index-based slicing. Replaces conditional branches checking for -1 padding with unified argmin pattern, reducing dependency on padding assumptions and simplifying control flow.

Changes

Cohort / File(s)	Summary
Packed Sequence Slicing Refactor `src/megatron/bridge/training/utils/packed_seq_utils.py`	Reworks slicing logic for cu_seqlens_padded and cu_seqlens_unpadded to use argmin-based approach directly when indicators are present; removes conditional min==-1 branches and explicit assertions; unifies control flow to consistent argmin-based pattern.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

🚥 Pre-merge checks | ✅ 3 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Test Results For Major Changes	⚠️ Warning	PR reverts changes due to performance degradation but lacks required performance benchmarking results, before-and-after metrics, test validation, and incomplete pre-check checklist.	Add performance benchmarking with before-and-after metrics, include test results validating packed sequence slicing logic, and complete pre-check testing checklist.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Revert packed seq extra checks' accurately describes the main change—reverting recent packed sequence modifications from PR `#1997` that caused performance issues.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch chcui/revert_packed_seq

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@src/megatron/bridge/training/utils/packed_seq_utils.py`:
- Around line 46-55: The current slicing using torch.argmin on cu_seqlens_padded
and cu_seqlens_unpadded can produce [:0] when there is no -1 marker (min at
index 0); modify the logic in the block handling cu_seqlens_padded and
cu_seqlens_unpadded so that when the corresponding argmin indicator is None you
first check the minimum value (e.g. via torch.min(...).item() or tensor.min())
and only perform the slice when that minimum is negative, otherwise leave the
tensor unchanged; update the branches that reference cu_seqlens_argmin,
cu_seqlens_unpadded_argmin, torch.argmin, cu_seqlens_padded and
cu_seqlens_unpadded accordingly.

src/megatron/bridge/training/utils/packed_seq_utils.py

cuichenx · 2026-02-02T23:39:11Z

/ok to test 87c2d06

cuichenx · 2026-02-03T05:35:15Z

/ok to test 87c2d06

Signed-off-by: Chen Cui <chcui@nvidia.com>

Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

Signed-off-by: Chen Cui <chcui@nvidia.com>

Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: sowmen <sowmendipta@gmail.com>

revert packed seq extra checks

87c2d06

Signed-off-by: Chen Cui <chcui@nvidia.com>

cuichenx added the r0.3.0 Cherry-pick label for r0.3.0 release branch label Feb 2, 2026

coderabbitai bot reviewed Feb 2, 2026

View reviewed changes

src/megatron/bridge/training/utils/packed_seq_utils.py Show resolved Hide resolved

cuichenx requested a review from yaoyu-33 February 2, 2026 22:09

copy-pr-bot bot temporarily deployed to test February 3, 2026 05:36 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 3, 2026 05:58 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 3, 2026 06:05 Failure

remove invalid test cases..

18a6463

Signed-off-by: Chen Cui <chcui@nvidia.com>

copy-pr-bot bot temporarily deployed to nemo-ci February 3, 2026 06:39 Inactive

copy-pr-bot bot temporarily deployed to test February 3, 2026 06:39 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 3, 2026 06:42 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 3, 2026 06:50 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 3, 2026 06:59 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 3, 2026 06:59 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 3, 2026 06:59 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci February 3, 2026 06:59 Failure

copy-pr-bot bot temporarily deployed to nemo-ci February 3, 2026 06:59 Inactive

ko3n1g approved these changes Feb 3, 2026

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci February 3, 2026 17:36 Inactive

cuichenx merged commit 18c66e3 into main Feb 3, 2026
82 of 85 checks passed

cuichenx deleted the chcui/revert_packed_seq branch February 3, 2026 18:06

ko3n1g pushed a commit that referenced this pull request Feb 3, 2026

Revert packed seq extra checks (#2180)

75f3195

Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

yaoyu-33 pushed a commit that referenced this pull request Feb 3, 2026

Revert packed seq extra checks (#2180)

8866c35

Signed-off-by: Chen Cui <chcui@nvidia.com>

coderabbitai bot mentioned this pull request Feb 5, 2026

[training] fix: Add cu_seqlens_argmin to vlm packed sequence #2246

Merged

5 tasks

sowmen pushed a commit to sowmen/Megatron-Bridge that referenced this pull request Feb 11, 2026

Revert packed seq extra checks (NVIDIA-NeMo#2180)

17ad99f

Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: sowmen <sowmendipta@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert packed seq extra checks#2180

Revert packed seq extra checks#2180
cuichenx merged 2 commits intomainfrom
chcui/revert_packed_seq

cuichenx commented Feb 2, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Feb 2, 2026

Walkthrough

Changes

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

cuichenx commented Feb 2, 2026

Uh oh!

cuichenx commented Feb 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cuichenx commented Feb 2, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Additional Information

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 2, 2026

Walkthrough

Changes

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cuichenx commented Feb 2, 2026

Uh oh!

cuichenx commented Feb 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cuichenx commented Feb 2, 2026 •

edited by coderabbitai bot

Loading