-
Notifications
You must be signed in to change notification settings - Fork 355
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Examples] Correct out-of-date warning for kv cache examples
ready
When a PR is ready for review
#2209
opened Jan 9, 2026 by
kylesayrs
Loading…
Add batch token statistics logging to When a PR is ready for review
LengthAwareSampler
ready
#2204
opened Jan 8, 2026 by
jwpark33
Loading…
[Bugfix] Fix data_collator default docstring
ready
When a PR is ready for review
#2197
opened Jan 7, 2026 by
kylesayrs
Loading…
Switch the pytorch tests to run on the L4 runners
ready
When a PR is ready for review
#2195
opened Jan 7, 2026 by
dhuangnm
Loading…
[AWQ] speed improvements
awq
For any issue / PR related to AWQ support
enhancement
New feature or request
ready
When a PR is ready for review
fix: suppress tokenizer parallelism warning in oneshot
#2183
opened Jan 4, 2026 by
majiayu000
Loading…
Refactor gpt oss quantization use all expert quantization
#2164
opened Dec 21, 2025 by
saraswatmks
Loading…
[Tracing] Dispatch after tracing
ready
When a PR is ready for review
#2146
opened Dec 17, 2025 by
kylesayrs
Loading…
[Args] Shuffle data samples by default
ready
When a PR is ready for review
#2144
opened Dec 17, 2025 by
kylesayrs
Loading…
[Bugfix] Improve pipeline inference
ready
When a PR is ready for review
#2131
opened Dec 15, 2025 by
kylesayrs
Loading…
[Examples] QwenOmni Example
ready
When a PR is ready for review
#2125
opened Dec 14, 2025 by
kylesayrs
Loading…
[Misc] Better debugging and guards to autowrapping
ready
When a PR is ready for review
#2124
opened Dec 14, 2025 by
kylesayrs
Loading…
feat: add importance-aware mixed-precision quantization
#2083
opened Dec 2, 2025 by
wangwenmingaa
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.