Skip to content

[Bugfix] Fix numel() downcast in fused_layernorm_dynamic_per_token_quant.cu#17316

Merged
vllm-bot merged 1 commit intovllm-project:mainfrom
r-barnes:export-D73608869
Apr 29, 2025
Merged

[Bugfix] Fix numel() downcast in fused_layernorm_dynamic_per_token_quant.cu#17316
vllm-bot merged 1 commit intovllm-project:mainfrom
r-barnes:export-D73608869

Conversation

@r-barnes
Copy link
Contributor

Summary: numel() has type int64_t. The implicit downcasts fix in this change artificially truncate data ranges which can lead to hard-to-debug errors and SEVs. Using auto ensures that the correct data type is used.

Reviewed By: dtolnay

Differential Revision: D73608869

…layernorm_dynamic_per_token_quant.cu +1

Summary: `numel()` has type `int64_t`. The implicit downcasts fix in this change artificially truncate data ranges which can lead to hard-to-debug errors and SEVs. Using `auto` ensures that the correct data type is used.

Reviewed By: dtolnay

Differential Revision: D73608869
@github-actions
Copy link

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

@facebook-github-bot
Copy link

This pull request was exported from Phabricator. Differential Revision: D73608869

@tlrmchlsmth tlrmchlsmth added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 28, 2025
@tlrmchlsmth
Copy link
Member

LGTM, thank you!

@tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) April 28, 2025 17:20
@tlrmchlsmth tlrmchlsmth changed the title Fix numel() downcast in vllm/csrc/quantization/fused_kernels/fused_layernorm_dynamic_per_token_quant.cu +1 [Bugfix] Fix numel() downcast in fused_layernorm_dynamic_per_token_quant.cu Apr 28, 2025
@vllm-bot vllm-bot merged commit d6da8a8 into vllm-project:main Apr 29, 2025
89 of 92 checks passed
lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025
RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025
…quant.cu (vllm-project#17316)

Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>
zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025
…quant.cu (vllm-project#17316)

Signed-off-by: Yuqi Zhang <yuqizhang@google.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants