Skip to content

[Quantization] enable compressed-tensors marlin support for turing (2)#31008

Merged
Isotr0py merged 3 commits intovllm-project:mainfrom
jinzhen-lin:patch-5
Dec 19, 2025
Merged

[Quantization] enable compressed-tensors marlin support for turing (2)#31008
Isotr0py merged 3 commits intovllm-project:mainfrom
jinzhen-lin:patch-5

Conversation

@jinzhen-lin
Copy link
Copy Markdown
Contributor

@jinzhen-lin jinzhen-lin commented Dec 19, 2025

In the previous PR #31000 , I tried to apply the suggestions from gemini-code-assist, but I realized it is committed to a new branch. Sorry.

cc @mgoin

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request lowers the minimum required device capability for Marlin from compute capability 8.0 to 7.5, enabling support for NVIDIA Turing GPUs. This change aligns the Python-level checks with the C++ kernel implementations, which already contain support for Turing architecture. The modification is correct and well-contained. I have no further comments.

@jinzhen-lin jinzhen-lin marked this pull request as draft December 19, 2025 04:50
@jinzhen-lin jinzhen-lin marked this pull request as ready for review December 19, 2025 04:52
@Isotr0py Isotr0py enabled auto-merge (squash) December 19, 2025 06:51
@github-actions github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 19, 2025
@Isotr0py Isotr0py merged commit 9187de9 into vllm-project:main Dec 19, 2025
54 checks passed
yugong333 pushed a commit to yugong333/vllm that referenced this pull request Dec 22, 2025
Majid-Taheri pushed a commit to Majid-Taheri/vllm that referenced this pull request Dec 23, 2025
vllm-project#31008)

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Signed-off-by: Ubuntu <mjtaheri68@gmail.com>
dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026
vllm-project#31008)

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants