[0.13.0][Doc] Supplement PD separation parameters of DeepSeek V3.1#6054
Merged
wangxiyuan merged 1 commit intovllm-project:releases/v0.13.0from Jan 22, 2026
Merged
Conversation
Contributor
There was a problem hiding this comment.
Code Review
This pull request updates the documentation for DeepSeek V3.1, providing new recommended parameters for a Prefill-Decode (PD) separation deployment. The changes include adjusting speculative decoding settings, GPU memory utilization, and compilation configurations, as well as introducing the VLLM_ASCEND_ENABLE_FLASHCOMM1 environment variable for communication optimization. A new 'Notice' section has been added to explain the core parameters, which greatly improves the clarity of the documentation. The changes are consistent across the different node configurations and appear to be well-considered. I have no concerns with this update.
c33b0ae to
a8c589f
Compare
Signed-off-by: chenmenglong <chenmenglong1@huawei.com>
a8c589f to
cf93cb9
Compare
845473182
pushed a commit
to 845473182/vllm-ascend
that referenced
this pull request
Jan 22, 2026
…lm-ascend into FIA_v0.13.0 * 'releases/v0.13.0' of https://github.com/vllm-project/vllm-ascend: [0.13.0][Doc] Supplement PD separation parameters of DeepSeek V3.1 (vllm-project#6054) [EPLB][Bugfix][v0.13.0] Incorporate the warm up of the EPLB into the profile run. (vllm-project#6099) [EPLB][Bugfix] Dispatch Allgather use log2phy if enable eplb (vllm-project#5933) (vllm-project#6016) [0.13.0][CI]fix for CI lint (vllm-project#6093) [0.13.0][cherry-pick][bugfix] fix the complex and potentially problematic generate_kv_idx. (vllm-project#5955)
starmountain1997
pushed a commit
to starmountain1997/vllm-ascend
that referenced
this pull request
Jan 31, 2026
…llm-project#6054) ### What this PR does / why we need it? Supplement PD separation parameters of DeepSeek V3.1 The recommended parameter configuration for DeepSeek V3.1 in the EP32 scenario after PD separation has been adjusted, and the core parameters have been described in detail. Signed-off-by: chenmenglong <chenmenglong1@huawei.com>
tangtiangu
pushed a commit
to tangtiangu/jiusi-vllm-ascend
that referenced
this pull request
Feb 24, 2026
…llm-project#6054) ### What this PR does / why we need it? Supplement PD separation parameters of DeepSeek V3.1 The recommended parameter configuration for DeepSeek V3.1 in the EP32 scenario after PD separation has been adjusted, and the core parameters have been described in detail. Signed-off-by: chenmenglong <chenmenglong1@huawei.com>
tangtiangu
pushed a commit
to tangtiangu/jiusi-vllm-ascend
that referenced
this pull request
Feb 24, 2026
…llm-project#6054) ### What this PR does / why we need it? Supplement PD separation parameters of DeepSeek V3.1 The recommended parameter configuration for DeepSeek V3.1 in the EP32 scenario after PD separation has been adjusted, and the core parameters have been described in detail. Signed-off-by: chenmenglong <chenmenglong1@huawei.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What this PR does / why we need it?
Supplement PD separation parameters of DeepSeek V3.1
The recommended parameter configuration for DeepSeek V3.1 in the EP32 scenario after PD separation has been adjusted, and the core parameters have been described in detail.
Does this PR introduce any user-facing change?
How was this patch tested?