Skip to content

[Quantization] enable compressed-tensors marlin support for turing#31000

Merged
vllm-bot merged 1 commit intovllm-project:mainfrom
jinzhen-lin:patch-4
Dec 19, 2025
Merged

[Quantization] enable compressed-tensors marlin support for turing#31000
vllm-bot merged 1 commit intovllm-project:mainfrom
jinzhen-lin:patch-4

Conversation

@jinzhen-lin
Copy link
Copy Markdown
Contributor

@jinzhen-lin jinzhen-lin commented Dec 19, 2025

The Marlin support for turing have beed added in #29901

This PR enable it in compressed-tensors.

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request aims to enable Marlin support for Turing GPUs by lowering the minimum required compute capability from 8.0 to 7.5. While the change in MarlinLinearKernel.get_min_capability is correct, it is incomplete. A related check in vllm/model_executor/layers/quantization/utils/marlin_utils.py also needs to be updated to reflect the new minimum capability. Without this additional change, Marlin will remain disabled on Turing GPUs. I've left a critical comment with the details of the required fix.

@vllm-bot vllm-bot merged commit de08b8f into vllm-project:main Dec 19, 2025
8 checks passed
yugong333 pushed a commit to yugong333/vllm that referenced this pull request Dec 22, 2025
Majid-Taheri pushed a commit to Majid-Taheri/vllm that referenced this pull request Dec 23, 2025
…llm-project#31000)

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Signed-off-by: Ubuntu <mjtaheri68@gmail.com>
dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026
…llm-project#31000)

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants