[CI]update triton ascend version#5392
Conversation
|
Note Gemini is unable to generate a review for this pull request due to the file types involved not being currently supported. |
|
👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:
If CI fails, you can run linting and testing checks locally according Contributing and Testing. |
777202c to
88f1e5c
Compare
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
91e794b to
4a5e98d
Compare
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
4a5e98d to
9d6e913
Compare
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
| n = cu_num_draft_tokens.numel() | ||
| BLOCK_SIZE = 2 | ||
| grid = triton.cdiv(n, BLOCK_SIZE) | ||
| from triton.runtime import driver # type: ignore |
There was a problem hiding this comment.
reuse get_vectorcore_num() in
There was a problem hiding this comment.
Thanks, I will update this in a new PR to fix the Triton test file bug.
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
|
|
||
| ```bash | ||
| source /usr/local/Ascend/ascend-toolkit/8.3.RC2/bisheng_toolkit/set_env.sh | ||
| BISHENG_NAME="Ascend-BiSheng-toolkit_aarch64_20251225.run" |
There was a problem hiding this comment.
plz use uname -i to get the architecture dynamicly, thus we could satisfy the x86 users also
There was a problem hiding this comment.
thanks, it has been update in $(uname -i)
Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
98851d4 to
e56f1ab
Compare
MengqingCao
left a comment
There was a problem hiding this comment.
LGTM, thx for this!
…to FIA_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: (88 commits) [1/N] Refactor nightly test structure (vllm-project#5479) Docs: Remove deprecated --task parameter for embedding models (vllm-project#5257) Revert "moe_gating_top_k" (vllm-project#5512) [Doc] Fix issue link for 0.12.0 (vllm-project#5500) [CI]update triton ascend version (vllm-project#5392) moe_gating_top_k (vllm-project#5271) [refactor] refactor model runner capture model (vllm-project#5230) Update corresponding vllm commit ID to 12 29 (vllm-project#5475) [Kernel]update csrc cmakelist for open-source cann (vllm-project#5458) [OP] add custom op aclnnMoeInitRoutingCustom (vllm-project#5251) [Refactor][EAGLE] 1/N delete __init__ in mtp_proposer (vllm-project#5176) [Refactor][Triton] Move reject sample triton kernels into ops/triton (vllm-project#5324) [Feature] support eager mode in model runner v2 (vllm-project#5210) [feature] fia support sliding windows (vllm-project#5239) Optimize some rejectsampler functions to make npu op launch non-blocking (vllm-project#4587) [Feature] Support to use fullgraph with eagle (vllm-project#5118) [EPLB][refactor] Modification of the initialization logic for expert_map and log2phy(depend on pr5285) (vllm-project#5311) [Refactor]6/N Extract common code of class AscendMLAImpl (vllm-project#5314) [Refactor] cache cos/sin in mla & remove parameter model in builder. (vllm-project#5277) update vllm pin to 12.27 (vllm-project#5412) ...
### What this PR does / why we need it? update triton-ascend version to 1229 and bisheng version in 1225; - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@254f6b9 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
### What this PR does / why we need it? update triton-ascend version to 1229 and bisheng version in 1225; - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@254f6b9 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>
### What this PR does / why we need it? update triton-ascend version to 1229 and bisheng version in 1225; - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@254f6b9 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>
### What this PR does / why we need it? update triton-ascend version to 1229 and bisheng version in 1225; - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@254f6b9 --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>
What this PR does / why we need it?
update triton-ascend version to 1229 and bisheng version in 1225;
Does this PR introduce any user-facing change?
How was this patch tested?