Conversation
|
@scsudhakaran can you remove the draft tag? |
e7667ec to
3bdd232
Compare
📝 WalkthroughWalkthroughThe changes modify DeepSeek V3 pretraining configurations for H100 GPUs by adjusting tensor and pipeline model parallelism settings, propagating pipeline layout configurations conditionally, and adding targeted environment variable configuration for specific hardware and model combinations. Changes
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~20 minutes Possibly related PRs
Suggested labels
Suggested reviewers
🚥 Pre-merge checks | ✅ 3 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (3 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing touches
🧪 Generate unit tests (beta)
No actionable comments were generated in the recent review. 🎉 🧹 Recent nitpick comments
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
/ok to test 3bdd232 |
Signed-off-by: Sanju C Sudhakaran <scsudhakaran@nvidia.com>
3bdd232 to
724e50c
Compare
|
/ok to test 724e50c |
Signed-off-by: Sanju C Sudhakaran <scsudhakaran@nvidia.com> Signed-off-by: sowmen <sowmendipta@gmail.com>
This PR updates the DeepSeek-V3 H100 recipes with a configuration that provides better performance numbers.
Summary by CodeRabbit
Release Notes
Configuration Updates
Performance Enhancements