Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
35 commits
Select commit Hold shift + click to select a range
af94ce4
[refactor] refactor excute_model and _dymmy_run method
kunpengW-code Jan 20, 2026
df95ddf
Merge branch 'main' of https://github.com/vllm-project/vllm-ascend in…
kunpengW-code Jan 20, 2026
8ce11c1
[refactor] refactor excute_model and _dymmy_run method
kunpengW-code Jan 20, 2026
39e16ee
[refactor] refactor excute_model and _dymmy_run method
kunpengW-code Jan 20, 2026
f068d67
[Bugfix] Fix lint & add ec_connector_output for EPD
gcanlin Jan 20, 2026
0d15960
Merge pull request #1 from gcanlin/lint-ec
kunpengW-code Jan 21, 2026
b4bac2f
fix ci
kunpengW-code Jan 21, 2026
a7499e9
Merge branch 'main' of https://github.com/vllm-project/vllm-ascend in…
kunpengW-code Jan 21, 2026
fbde812
fix ci
kunpengW-code Jan 21, 2026
d466779
fix ci
kunpengW-code Jan 21, 2026
5551075
fix ci
kunpengW-code Jan 21, 2026
e3112ba
fix ci
kunpengW-code Jan 23, 2026
16002b7
Merge branch 'main' of https://github.com/vllm-project/vllm-ascend in…
kunpengW-code Jan 23, 2026
2a5b2da
fix ci
kunpengW-code Jan 23, 2026
cd09771
fix ci
kunpengW-code Jan 24, 2026
a348382
fix ci
kunpengW-code Jan 25, 2026
bdfc21b
Merge branch 'main' of https://github.com/vllm-project/vllm-ascend in…
kunpengW-code Jan 25, 2026
6bd5af1
fix ci
kunpengW-code Jan 25, 2026
f2ebdc9
fixci
zhenwenqi2024 Jan 26, 2026
6e9d6df
fixci
zhenwenqi2024 Jan 26, 2026
bd60de2
fixci
zhenwenqi2024 Jan 26, 2026
7985f03
fixci
zhenwenqi2024 Jan 26, 2026
cef3736
fixci
zhenwenqi2024 Jan 26, 2026
6627fb2
fixci
zhenwenqi2024 Jan 26, 2026
b5237f0
fixci
zhenwenqi2024 Jan 26, 2026
6b591d5
fixci
zhenwenqi2024 Jan 26, 2026
2200265
fixci
zhenwenqi2024 Jan 26, 2026
f2b749c
fixci
zhenwenqi2024 Jan 26, 2026
01e635b
fixci
zhenwenqi2024 Jan 27, 2026
42f1a2f
fixci
zhenwenqi2024 Jan 27, 2026
61cb587
fixci
zhenwenqi2024 Jan 27, 2026
adfdf8d
fixci
zhenwenqi2024 Jan 27, 2026
428c206
fixci
zhenwenqi2024 Jan 27, 2026
b6e89b9
fixci
zhenwenqi2024 Jan 27, 2026
240b814
fixci
zhenwenqi2024 Jan 27, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions tests/ut/worker/test_pcp_manager.py
Original file line number Diff line number Diff line change
Expand Up @@ -123,6 +123,7 @@ def test_update_tokens_for_pcp_basic(tokens, num_reqs, num_computed_tokens,
vllm_config = MagicMock()
vllm_config.model_config = MagicMock()
vllm_config.speculative_config.num_speculative_tokens = 0
vllm_config.scheduler_config.max_num_seqs = 1000

pcp_manager = PCPManager(pcp_world_size=pcp_size,
pcp_rank=0,
Expand Down
2 changes: 1 addition & 1 deletion vllm_ascend/attention/sfa_v1.py
Original file line number Diff line number Diff line change
Expand Up @@ -170,7 +170,7 @@ def __init__(
npu_fused_infer_attention_score TND layout's limit of 16, \
got {self.decode_threshold}"
)

self.reorder_batch_threshold = self.decode_threshold
self.attn_mask_builder = AttentionMaskBuilder(self.device)
self.rope_dim = self.model_config.hf_text_config.qk_rope_head_dim
self.enable_dsa_cp = enable_dsa_cp()
Expand Down
Loading