Tfh buildfaster 0310 test by tfhddd · Pull Request #6 · ascend-gha-runners/vllm-ascend

tfhddd · 2026-03-10T12:51:38Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.16.0
vLLM main: vllm-project/vllm@4034c3d

vllm-project#6873) This reverts commit 3953dcf. to keep the basic functions available --------- Signed-off-by: wangli <wangli858794774@gmail.com>

…ject#7045) ### What this PR does / why we need it? This PR adds comprehensive documentation for the CPU binding feature on Ascend NPUs. It includes: - A detailed developer guide (`docs/source/developer_guide/feature_guide/cpu_binding.md`) covering the design, internal logic, allocation examples, and troubleshooting for the CPU binding mechanism. - A concise user guide (`docs/source/user_guide/feature_guide/cpu_binding.md`) explaining the core concepts, usage, and common issues for end-users. - An update to `additional_config.md` to use consistent terminology for binding strategies (`global-slicing` and `topo-affinity`). This documentation is needed to help both developers and users understand, use, and debug the CPU binding feature, which is critical for performance on ARM+Ascend platforms. ### Does this PR introduce _any_ user-facing change? No. This is a documentation-only update. ### How was this patch tested? The documentation has been reviewed for clarity and technical accuracy. The examples and descriptions align with the implementation in `vllm_ascend/cpu_binding.py`. - vLLM version: v0.16.0 - vLLM main: vllm-project/vllm@4034c3d --------- Signed-off-by: chenchuw886 <chenchuw@huawei.com> Signed-off-by: c00818886 <chenchuwei@huawei.com> Co-authored-by: chenchuw886 <chenchuw@huawei.com>

### What this PR does / why we need it? New Quantization Method: Introduced support for the W8A8SC static linear quantization scheme specifically for 310P hardware, enabling more efficient model compression. Refactored the save_sharded_state_310.py to avoid multi-process issue. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? W8A8SC quant E2E test. - vLLM version: v0.16.0 - vLLM main: vllm-project/vllm@4034c3d --------- Signed-off-by: pu-zhe <zpuaa@outlook.com>

) ### What this PR does / why we need it? 1. Increase nightly multi-node test max-parallel from 1 to 2, and fix resource conflicts that arise when tests run concurrently. 2. Fix parse-trigger job: Add an if condition so it only runs on schedule, workflow_dispatch, or PRs labeled nightly-test 3. Adjust nightly schedule: Shift trigger time from 24:00 to 23:45 (UTC+8) ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.16.0 - vLLM main: vllm-project/vllm@4034c3d --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com>

### What this PR does / why we need it? Fix acceptance and high-concurrency bug in eagle3 and cp enabled ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? tests and ut - vLLM version: v0.16.0 - vLLM main: vllm-project/vllm@4034c3d --------- Signed-off-by: lilinsiman <lilinsiman@gmail.com>

github-actions · 2026-03-10T12:51:56Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

### What this PR does / why we need it? | File Path | | :--- | | `tests/ut/batch_invariant/test_batch_invariant.py` | | `tests/ut/compilation/test_acl_graph.py` | | `tests/ut/compilation/test_npugraph_ex_utils_check.py` | | `tests/ut/conftest.py` | | `tests/ut/core/test_scheduler_dynamic_batch.py` | | `tests/ut/device_allocator/test_camem.py` | | `tests/ut/device_allocator/test_cpu_binding.py` | | `tests/ut/distributed/device_communicators/test_pyhccl.py` | | `tests/ut/distributed/device_communicators/test_pyhccl_wrapper.py` | | `tests/ut/distributed/mooncake/test_config_data.py` | | `tests/ut/distributed/test_communicator.py` | | `tests/ut/distributed/test_parallel_state.py` | | `tests/ut/eplb/adaptor/test_vllm_adaptor.py` | | `tests/ut/eplb/core/policy/test_policy_abstract.py` | | `tests/ut/eplb/core/policy/test_policy_default_eplb.py` | | `tests/ut/eplb/core/policy/test_policy_factor.py` | | `tests/ut/eplb/core/policy/test_policy_swift_balancer.py` | | `tests/ut/eplb/core/test_eplb_device_transfer_loader.py` | | `tests/ut/eplb/core/test_eplb_utils.py` | | `tests/ut/spec_decode/test_eagle_proposer.py` | | `tests/ut/test_ascend_config.py` | | `tests/ut/test_envs.py` | | `tests/ut/test_platform.py` | | `tests/ut/test_utils.py` | | `tests/ut/worker/test_block_table.py` | | `tests/ut/worker/test_pcp_manager.py` | | `tests/ut/worker/test_worker_v1.py` | ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.16.0 - vLLM main: vllm-project/vllm@4034c3d Signed-off-by: MrZ20 <2609716663@qq.com>

### What this PR does / why we need it? | File Path | | :--- | | `tests/ut/batch_invariant/test_batch_invariant.py` | | `tests/ut/compilation/test_acl_graph.py` | | `tests/ut/compilation/test_npugraph_ex_utils_check.py` | | `tests/ut/conftest.py` | | `tests/ut/core/test_scheduler_dynamic_batch.py` | | `tests/ut/device_allocator/test_camem.py` | | `tests/ut/device_allocator/test_cpu_binding.py` | | `tests/ut/distributed/device_communicators/test_pyhccl.py` | | `tests/ut/distributed/device_communicators/test_pyhccl_wrapper.py` | | `tests/ut/distributed/mooncake/test_config_data.py` | | `tests/ut/distributed/test_communicator.py` | | `tests/ut/distributed/test_parallel_state.py` | | `tests/ut/eplb/adaptor/test_vllm_adaptor.py` | | `tests/ut/eplb/core/policy/test_policy_abstract.py` | | `tests/ut/eplb/core/policy/test_policy_default_eplb.py` | | `tests/ut/eplb/core/policy/test_policy_factor.py` | | `tests/ut/eplb/core/policy/test_policy_swift_balancer.py` | | `tests/ut/eplb/core/test_eplb_device_transfer_loader.py` | | `tests/ut/eplb/core/test_eplb_utils.py` | | `tests/ut/spec_decode/test_eagle_proposer.py` | | `tests/ut/test_ascend_config.py` | | `tests/ut/test_envs.py` | | `tests/ut/test_platform.py` | | `tests/ut/test_utils.py` | | `tests/ut/worker/test_block_table.py` | | `tests/ut/worker/test_pcp_manager.py` | | `tests/ut/worker/test_worker_v1.py` | ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.16.0 - vLLM main: vllm-project/vllm@4034c3d Signed-off-by: MrZ20 <2609716663@qq.com> Signed-off-by: tfhddd <2272751277@qq.com>

Potabk and others added 7 commits March 10, 2026 11:27

Revert "[Feature][Quant] Auto-detect quantization format from model f… (

33234aa

vllm-project#6873) This reverts commit 3953dcf. to keep the basic functions available --------- Signed-off-by: wangli <wangli858794774@gmail.com>

feat: [CI] Introduce uv to accelerate pip install

7427cf3

feat: [CI] Introduce uv to accelerate pip install

2beee95

github-actions Bot added documentation Improvements or additions to documentation ci/build module:tests module:quantization module:core labels Mar 10, 2026

tfhddd added 6 commits March 10, 2026 21:06

feat: [CI] Introduce uv to accelerate pip install

ab5533e

feat: [CI] Introduce uv to accelerate pip install

bc4923d

feat: [CI] Introduce uv to accelerate pip install

7fdbba6

feat: [CI] Introduce uv to accelerate pip install

bb878fd

feat: [CI] Introduce uv to accelerate pip install

93f034b

feat: [CI] Introduce uv to accelerate pip install

67c09b1

tfhddd closed this by deleting the head repository Apr 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tfh buildfaster 0310 test#6

Tfh buildfaster 0310 test#6
tfhddd wants to merge 13 commits intoascend-gha-runners:mainfrom
tfhddd:tfh_buildfaster_0310_test

tfhddd commented Mar 10, 2026 •

edited by github-actions Bot

Loading

Uh oh!

github-actions Bot commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

tfhddd commented Mar 10, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions Bot commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

tfhddd commented Mar 10, 2026 •

edited by github-actions Bot

Loading