Skip to content

[0.13.0][cherry-pick][bugfix] fix the complex and potentially problematic generate_kv_idx.#5955

Merged
yiz-liu merged 1 commit intovllm-project:releases/v0.13.0from
pisceskkk:fix_cp-013
Jan 21, 2026
Merged

[0.13.0][cherry-pick][bugfix] fix the complex and potentially problematic generate_kv_idx.#5955
yiz-liu merged 1 commit intovllm-project:releases/v0.13.0from
pisceskkk:fix_cp-013

Conversation

@pisceskkk
Copy link
Copy Markdown
Contributor

@pisceskkk pisceskkk commented Jan 16, 2026

What this PR does / why we need it?

In long-sequence scenarios, the chunked-prefill component may encounter dimension misalignment issues, which previously occurred during precision testing on the code_generate_lite dataset. This PR removes redundant computations and instead derives the value using existing results and straightforward calculations.
ref: #5957

@pisceskkk pisceskkk force-pushed the fix_cp-013 branch 3 times, most recently from 678932e to 6cd7354 Compare January 19, 2026 01:27
@pisceskkk pisceskkk closed this Jan 19, 2026
@pisceskkk pisceskkk reopened this Jan 19, 2026
@weiguihua2 weiguihua2 added ready read for review ready-for-test start test by label for PR labels Jan 19, 2026
@github-actions
Copy link
Copy Markdown
Contributor

This pull request has conflicts, please resolve those before we can evaluate the pull request.

@pisceskkk pisceskkk force-pushed the fix_cp-013 branch 2 times, most recently from db29155 to 48a1e99 Compare January 20, 2026 09:26
Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>
@pisceskkk pisceskkk changed the title [0.13.0][bugfix] fix the complex and potentially problematic generate_kv_idx. [0.13.0][cherry-pick][bugfix] fix the complex and potentially problematic generate_kv_idx. Jan 21, 2026
@yiz-liu yiz-liu merged commit 5c40992 into vllm-project:releases/v0.13.0 Jan 21, 2026
12 checks passed
@pisceskkk pisceskkk deleted the fix_cp-013 branch January 21, 2026 08:01
845473182 pushed a commit to 845473182/vllm-ascend that referenced this pull request Jan 22, 2026
…lm-ascend into FIA_v0.13.0

* 'releases/v0.13.0' of https://github.com/vllm-project/vllm-ascend:
  [0.13.0][Doc] Supplement PD separation parameters of DeepSeek V3.1 (vllm-project#6054)
  [EPLB][Bugfix][v0.13.0] Incorporate the warm up of the EPLB into the profile run. (vllm-project#6099)
  [EPLB][Bugfix] Dispatch Allgather use log2phy if enable eplb (vllm-project#5933) (vllm-project#6016)
  [0.13.0][CI]fix for CI lint (vllm-project#6093)
  [0.13.0][cherry-pick][bugfix] fix the complex and potentially problematic generate_kv_idx. (vllm-project#5955)
starmountain1997 pushed a commit to starmountain1997/vllm-ascend that referenced this pull request Jan 31, 2026
…atic generate_kv_idx. (vllm-project#5955)

### What this PR does / why we need it?
In long-sequence scenarios, the chunked-prefill component may encounter
dimension misalignment issues, which previously occurred during
precision testing on the code_generate_lite dataset. This PR removes
redundant computations and instead derives the value using existing
results and straightforward calculations.
ref: vllm-project#5957

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>
tangtiangu pushed a commit to tangtiangu/jiusi-vllm-ascend that referenced this pull request Feb 24, 2026
…atic generate_kv_idx. (vllm-project#5955)

### What this PR does / why we need it?
In long-sequence scenarios, the chunked-prefill component may encounter
dimension misalignment issues, which previously occurred during
precision testing on the code_generate_lite dataset. This PR removes
redundant computations and instead derives the value using existing
results and straightforward calculations.
ref: vllm-project#5957

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>
tangtiangu pushed a commit to tangtiangu/jiusi-vllm-ascend that referenced this pull request Feb 24, 2026
…atic generate_kv_idx. (vllm-project#5955)

### What this PR does / why we need it?
In long-sequence scenarios, the chunked-prefill component may encounter
dimension misalignment issues, which previously occurred during
precision testing on the code_generate_lite dataset. This PR removes
redundant computations and instead derives the value using existing
results and straightforward calculations.
ref: vllm-project#5957

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready read for review ready-for-test start test by label for PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants