[CI] Add DeepSeek-V3.2 large EP nightly ci#6378
Conversation
|
👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:
If CI fails, you can run linting and testing checks locally according Contributing and Testing. |
2cfba1f to
8eb7537
Compare
There was a problem hiding this comment.
Code Review
This pull request adds a new nightly CI test for DeepSeek-V3.2. The changes include a new YAML configuration file for the test and modifications to the CI script.
My review found a few issues:
- The new YAML configuration file contains duplicated arguments in the
server_cmdblocks, which could lead to unexpected behavior. I've suggested removing them. - The
run.shscript has been modified to fetch and checkout a hardcoded pull request. This is a critical issue for a CI script and should be removed before merging.
Overall, the changes are straightforward, but the issues identified need to be addressed to ensure the CI works correctly and is maintainable.
7f9a32f to
e1d87f0
Compare
e1d87f0 to
4beb03b
Compare
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
18884e4 to
e4cfcb0
Compare
1cf03a4 to
dc108d9
Compare
5678d30 to
790900d
Compare
7643c60 to
f0efaa2
Compare
431bc71 to
491ebc9
Compare
e719d39 to
365c8f9
Compare
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
96e77f7 to
6962557
Compare
6962557 to
0583826
Compare
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
0583826 to
6f6e365
Compare
Signed-off-by: hfadzxy <starmoon_zhang@163.com>
6f6e365 to
79bdc42
Compare
…to qwen3next_graph * 'main' of https://github.com/vllm-project/vllm-ascend: (40 commits) [Feature] Add docs of batch invariance and make some extra operators patch (vllm-project#6910) [bugfix]Qwen2.5VL accurate question (vllm-project#6975) [CI] Add DeepSeek-V3.2 large EP nightly ci (vllm-project#6378) [Ops][BugFix] Fix RoPE shape mismatch for mtp models with flashcomm v1 enabled (vllm-project#6939) [bugfix]fix file not found error in nightly of single-node (vllm-project#6976) [Bugfix] Fix the acceptance rates dorp issue when applying eagle3 to QuaRot model (vllm-project#6914) [CI] Enable auto upgrade e2e estimated time for auto-partition suites (vllm-project#6840) [Doc][Misc] Fix msprobe_guide.md documentation issues (vllm-project#6965) [Nightly][Refactor]Migrate nightly single-node model tests from `.py` to `.yaml` (vllm-project#6503) [BugFix] Improve GDN layer detection for multimodal models (vllm-project#6941) [feat]ds3.2 pcp support mtp and chunkprefill (vllm-project#6917) [CPU binding] Implement global CPU slicing and improve IRQ binding for Ascend NPUs (vllm-project#6945) [Triton] Centralize Ascend extension op dispatch in triton_utils (vllm-project#6937) [csrc][bugfix] Add compile-time Ascend950/910_95 compatibility for custom ops between CANN8.5 and 9.0 (vllm-project#6936) [300I][Bugfix] fix unquant model weight nd2nz error (vllm-project#6851) [doc] fix supported_models (vllm-project#6930) [CI] nightly test timeout (vllm-project#6912) [CI] Upgrade CANN to 8.5.1 (vllm-project#6897) [Model]Add Qwen3-Omni quantization Ascend NPU adaptation and optimization (vllm-project#6828) [P/D][v0.16.0]Adapt to RecomputeScheduler in vLLM 0.16.0 (vllm-project#6898) ...
### What this PR does / why we need it? Add DeepSeek-V3.2 nightly ci Fix PD routing to exclude headless nodes when collecting prefiller/decoder IPs - vLLM version: v0.14.1 - vLLM main: vllm-project/vllm@dc917cc Signed-off-by: hfadzxy <starmoon_zhang@163.com>
What this PR does / why we need it?
Add DeepSeek-V3.2 nightly ci
Fix PD routing to exclude headless nodes when collecting prefiller/decoder IPs
Does this PR introduce any user-facing change?
How was this patch tested?