Skip to content

small adjustment to wvSplitKrc#34410

Merged
gshtras merged 2 commits intovllm-project:mainfrom
amd-hhashemi:wvSplitKrc5
Feb 12, 2026
Merged

small adjustment to wvSplitKrc#34410
gshtras merged 2 commits intovllm-project:mainfrom
amd-hhashemi:wvSplitKrc5

Conversation

@amd-hhashemi
Copy link
Contributor

@amd-hhashemi amd-hhashemi commented Feb 12, 2026

In some vllm dockers, gpt-oss showing prompt fails without this change. Tests results unchanged.

Purpose

Test Plan

Test Result


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
@mergify mergify bot added the rocm Related to AMD ROCm label Feb 12, 2026
@github-project-automation github-project-automation bot moved this to Todo in AMD Feb 12, 2026
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request correctly fixes a critical bug in the wvSplitKrc_ kernel. By clamping the kOffcp offset, it prevents a potential out-of-bounds memory access on the A tensor that was causing runtime failures. The change is correct and improves the kernel's robustness. No further issues were found.

@gshtras
Copy link
Collaborator

gshtras commented Feb 12, 2026

Confirming that this fixes the single prompt request to /completions on a server w/o AITER outputting "!!!!!"

@gshtras gshtras enabled auto-merge (squash) February 12, 2026 15:56
@github-actions github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 12, 2026
@gshtras gshtras merged commit fac4e96 into vllm-project:main Feb 12, 2026
16 of 17 checks passed
@github-project-automation github-project-automation bot moved this from Todo to Done in AMD Feb 12, 2026
gshtras pushed a commit to ROCm/vllm that referenced this pull request Feb 16, 2026
Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
gshtras pushed a commit to ROCm/vllm that referenced this pull request Feb 17, 2026
Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
eldarkurtic pushed a commit to eldarkurtic/vllm that referenced this pull request Feb 19, 2026
Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
Signed-off-by: Eldar Kurtic <research@neuralmagic.com>
gshtras pushed a commit to ROCm/vllm that referenced this pull request Feb 19, 2026
Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
gshtras pushed a commit to ROCm/vllm that referenced this pull request Feb 20, 2026
Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
gshtras pushed a commit to ROCm/vllm that referenced this pull request Feb 23, 2026
Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026
Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026
Signed-off-by: Hashem Hashemi <hashem.hashemi@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

2 participants