cp: fix: Fix crash when using cp in dtensor path (1663) into r0.5.0#1665
cp: fix: Fix crash when using cp in dtensor path (1663) into r0.5.0#1665
fix: Fix crash when using cp in dtensor path (1663) into r0.5.0#1665Conversation
Signed-off-by: Yi-Fu Wu <yifu.wu@gmail.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
|
📝 WalkthroughWalkthroughModified DTensorPolicyWorkerV2 to support context-parallel SDPA handling. When context parallelism is enabled (cp_size > 1), the code imports SDPBackend and constructs an sdpa_method list containing FLASH_ATTENTION and EFFICIENT_ATTENTION backends, then passes this to model construction via Changes
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~20 minutes
Possibly related PRs
Suggested labels
Suggested reviewers
Pre-merge checks and finishing touches✅ Passed checks (4 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 📒 Files selected for processing (1)
🧰 Additional context used📓 Path-based instructions (4)**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
nemo_rl/**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
!(**/tests/**|**/test_*.py|**/test_*.sh)📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
**/*.{py,sh}📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
🧠 Learnings (3)📓 Common learnings📚 Learning: 2025-10-30T20:50:44.126ZApplied to files:
📚 Learning: 2025-09-19T03:00:58.662ZApplied to files:
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)
🔇 Additional comments (3)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
beep boop [🤖]: Hi @yfw 👋,
Summary by CodeRabbit
Release Notes
New Features
Improvements
✏️ Tip: You can customize this high-level summary in your review settings.