-
-
Notifications
You must be signed in to change notification settings - Fork 5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Kernel][Triton][AMD] Change default block size for triton_scaled_mm to 128 for 3-5x speedup
#11698
opened Jan 3, 2025 by
rasmith
Loading…
[Hardware][Apple] MacOs installation setup
ci/build
documentation
Improvements or additions to documentation
#11696
opened Jan 2, 2025 by
wallashss
Loading…
[V1] Add BlockTable class
ready
ONLY add when PR is ready to merge/full CI is needed
#11693
opened Jan 2, 2025 by
WoosukKwon
Loading…
Add split_special_tokens to the Tokenize Endpoint
frontend
#11691
opened Jan 2, 2025 by
ruediste
Loading…
k8s-config: Update the secret to use stringData
documentation
Improvements or additions to documentation
#11679
opened Jan 2, 2025 by
surajssd
Loading…
[torch.compile] Hide KV cache behind torch.compile boundary
#11677
opened Jan 2, 2025 by
heheda12345
•
Draft
[Bugfix][SpecDecode] Adjust Eagle model architecture to align with intended design
#11672
opened Jan 1, 2025 by
llsj14
Loading…
[CI/Build] Update OpenVINO Dockerfile to Ubuntu 24.04
ci/build
#11670
opened Jan 1, 2025 by
ruediste
Loading…
[V1] Simplify Shutdown
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#11659
opened Dec 31, 2024 by
robertgshaw2-neuralmagic
Loading…
[XPU] Make pp group initilized for pipeline-parallelism
#11648
opened Dec 31, 2024 by
ys950902
Loading…
[Doc] [1/N] Reorganize Getting Started section
documentation
Improvements or additions to documentation
#11645
opened Dec 31, 2024 by
DarkLight1337
Loading…
[Docs] reorganize sponsorship page
documentation
Improvements or additions to documentation
#11639
opened Dec 30, 2024 by
simon-mo
Loading…
[Quantization/Parameter] WIP: Replace parameter subclasses with raw nn.Parameter with additional attributes
#11622
opened Dec 30, 2024 by
cennn
Loading…
[torch.compile] consider relevant code in compilation cache
#11614
opened Dec 30, 2024 by
youkaichao
Loading…
[Do Not Merge] - LoRA V1 Reference PR
needs-rebase
#11613
opened Dec 30, 2024 by
varun-sundar-rabindranath
•
Draft
Previous Next
ProTip!
Follow long discussions with comments:>50.