Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[AWQ][Qwen3 VL] Add qwen3-vl-30b-a3b-Instruct-example awq For any issue / PR related to AWQ support qwen For any PR / issue related to Qwen support ready When a PR is ready for review
#1947 opened Oct 18, 2025 by JartX Loading…
Fix Qwen3 VL MoE Example qwen For any PR / issue related to Qwen support ready When a PR is ready for review
#1946 opened Oct 18, 2025 by dsikka Loading…
Update ReadMe to link to overview ready When a PR is ready for review
#1944 opened Oct 17, 2025 by dsikka Loading…
AWQ per-channel fix awq For any issue / PR related to AWQ support
#1942 opened Oct 17, 2025 by zhanglei1172 Loading…
[MXFP4] Support
#1938 opened Oct 15, 2025 by dsikka Draft
AI Fix for: Create AWQ guide for llm-docs
#1932 opened Oct 14, 2025 by shanaya-Gupta Loading…
[Attention] Support FP4 attention quantization nvfp4 For any PR / issue related to NVFP4 support
#1924 opened Oct 14, 2025 by kylesayrs Loading…
[Transforms] Use get_head_dim util
#1918 opened Oct 12, 2025 by kylesayrs Loading…
[Training] Fix tokenizer attribute of SessionMixin ready When a PR is ready for review
#1895 opened Oct 1, 2025 by kylesayrs Loading…
add gpt oss nvfp4 example
#1885 opened Sep 30, 2025 by shanjiaz Draft
Add awq activation fp8 support in loss compute
#1873 opened Sep 27, 2025 by Bluedyson Loading…
[Dependencies] update lm_eval version pin ready When a PR is ready for review
#1862 opened Sep 24, 2025 by brian-dellabetta Loading…
[Logging] clean up CompressionLogger verbosity ready When a PR is ready for review
#1861 opened Sep 23, 2025 by brian-dellabetta Loading…
MSE observer for NVFP4
#1840 opened Sep 17, 2025 by shubhra Loading…
ready label check ready When a PR is ready for review
#1832 opened Sep 17, 2025 by brian-dellabetta Loading…
1 task done
add support for per-head attention quantization
#1791 opened Sep 2, 2025 by eldarkurtic Loading…
ProTip! no:milestone will show everything without a milestone.