[Quant] SupportsQuant handles ignored_modules#14635
[Quant] SupportsQuant handles ignored_modules#14635kylesayrs wants to merge 4 commits intovllm-project:mainfrom
Conversation
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
…uant-ignored-modules
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
|
This pull request has merge conflicts that must be resolved before it can be |
|
Closing in favor of #20046 The above solution is a little bit better, as it can be implemented more incrementally and also is capable of updating more than just the ignored list |
Purpose
Changes
SupportsQuantnow modifies thequant_config.ignored_modulesattribute to account for the relevanthf_to_vllm_mapperignored_modulesattribute instead ofllm_int8_skip_modulesignored_modulesattribute instead ofmodules_to_not_convertSupportsQuantto qwen2_5_vlTesting