chore: Update vLLM compilation config to vLLM v0.14.1#5819
chore: Update vLLM compilation config to vLLM v0.14.1#5819ptarasiewiczNV merged 2 commits intomainfrom
Conversation
Signed-off-by: Piotr Tarasiewicz <ptarasiewicz@nvidia.com>
WalkthroughVLLM compilation configuration flags in the deploy_hopper_16gpu.yaml file were updated. The previous flags (enable_fusion, enable_attn_fusion, enable_noop, and custom_ops with rms_norm) were replaced with fuse_norm_quant and eliminate_noops flags while preserving cudagraph_mode. Changes
Estimated code review effort🎯 1 (Trivial) | ⏱️ ~3 minutes Poem
🚥 Pre-merge checks | ✅ 2 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (2 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. Tip 🧪 Unit Test Generation v2 is now available!We have significantly improved our unit test generation capabilities. To enable: Add this to your reviews:
finishing_touches:
unit_tests:
enabled: trueTry it out by using the Have feedback? Share your thoughts on our Discord thread! Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Signed-off-by: Piotr Tarasiewicz <ptarasiewicz@nvidia.com>
Signed-off-by: Piotr Tarasiewicz <ptarasiewicz@nvidia.com>
Overview:
Update compilation config to follow changes made in vllm-project/vllm#29646
Details:
Where should the reviewer start?
Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)
Summary by CodeRabbit
✏️ Tip: You can customize this high-level summary in your review settings.