[Compressed Tensors] Allow configs with non-explicit ignores by kylesayrs · Pull Request #41965 · vllm-project/vllm

kylesayrs · 2026-05-07T14:11:44Z

Purpose

Allow for greater config flexibility by not requiring all layers to be listed in the ignore list
- Any layers which do not match schemes are assumed to be ignored

Previously:

"quantization_config": {
  "config_groups": {
    "group_0": {
      "targets": [
        "re:.*attn.*_proj$",
      ],
    }
  },
  "ignore": [
    "layers.2.attn.indexer.weights_proj",
    "layers.4.attn.indexer.weights_proj",
    "layers.6.attn.indexer.weights_proj",
    "layers.8.attn.indexer.weights_proj",
    "layers.10.attn.indexer.weights_proj",
  ],
},

Now:

"quantization_config": {
  "config_groups": {
    "group_0": {
      "targets": [
        "re:.*attn.*_proj$",
      ],
    }
  },
  "ignore": [],
},

Changes

Allow find_matched_target to return None, and treat a non-match against schemes as an ignored layer

Testing

Added test_find_matched_target_returns_none_on_no_match
Added test_get_scheme_dict_returns_none_on_no_match

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

This pull request refactors find_matched_target to return None instead of raising a ValueError when a target is not found, updating the calling logic in compressed_tensors.py to handle these cases explicitly. Feedback was provided regarding a potential AttributeError in get_scheme_dict if the retrieved scheme dictionary is None, along with a suggestion to avoid in-place modifications of the configuration dictionary.

mgoin

LGTM Kyle, thanks for cleaning up the ValueError. Can you add a unit test to solidify this expected behavior?

dsikka

LGTM but we should probably add a basic smoke test for this.

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

…oject#41965) Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Signed-off-by: Libin Tang <libin.tang@intel.com>

allow non-explicit ignores

079a651

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs requested review from mgoin, pavanimajety, robertgshaw2-redhat, tlrmchlsmth and yewentao256 as code owners May 7, 2026 14:11

claude Bot reviewed May 7, 2026

View reviewed changes

gemini-code-assist Bot reviewed May 7, 2026

View reviewed changes

Comment thread vllm/model_executor/layers/quantization/compressed_tensors/compressed_tensors.py

mgoin approved these changes May 7, 2026

View reviewed changes

dsikka approved these changes May 7, 2026

View reviewed changes

kylesayrs added 2 commits May 7, 2026 11:57

add test

68bad70

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

fix ai test

1c2e89e

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

mgoin added ready ONLY add when PR is ready to merge/full CI is needed quantization labels May 7, 2026

vllm-bot merged commit c1819ca into vllm-project:main May 7, 2026
66 of 71 checks passed

kylesayrs deleted the kylesayrs/ct-non-explicit-ignore branch May 7, 2026 21:53

libinta pushed a commit to libinta/vllm that referenced this pull request May 8, 2026

[Compressed Tensors] Allow configs with non-explicit ignores (vllm-pr…

561e026

…oject#41965) Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Signed-off-by: Libin Tang <libin.tang@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Compressed Tensors] Allow configs with non-explicit ignores#41965

[Compressed Tensors] Allow configs with non-explicit ignores#41965
vllm-bot merged 3 commits intovllm-project:mainfrom
neuralmagic:kylesayrs/ct-non-explicit-ignore

kylesayrs commented May 7, 2026 •

edited

Loading

Uh oh!

claude Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

mgoin left a comment

Uh oh!

dsikka left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

kylesayrs commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Changes

Testing

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kylesayrs commented May 7, 2026 •

edited

Loading