-
-
Notifications
You must be signed in to change notification settings - Fork 6.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Misc] Replace os environ to monkeypatch in test suite
v1
#14516
opened Mar 9, 2025 by
t-sibiraj
Loading…
[BugFix][V1] Fix parallel sampling finishing/aborts
bug
Something isn't working
v1
#14512
opened Mar 9, 2025 by
njhill
Loading…
[Frontend] Support both tool calling and reasoning parser for reasoni…
frontend
#14511
opened Mar 9, 2025 by
WangErXiao
Loading…
[Misc] QoL: add speculative_model to SpeculativeConfig
speculative-decoding
v1
#14509
opened Mar 9, 2025 by
andylolu2
Loading…
[Frontend] Fix typo in tool chat templates for llama3.2 and toolace
documentation
Improvements or additions to documentation
#14501
opened Mar 8, 2025 by
bjj
Loading…
[Misc] Unify formatter and linter to use ruff
ci/build
documentation
Improvements or additions to documentation
misc
v1
#14485
opened Mar 8, 2025 by
aarnphm
Loading…
LLama 3.2 11b lm eval accuracy drop fix
ci/build
documentation
Improvements or additions to documentation
frontend
needs-rebase
speculative-decoding
structured-output
#14477
opened Mar 8, 2025 by
libinta
Loading…
[Frontend] Pythonic tool names flexibility (#14470)
frontend
#14474
opened Mar 8, 2025 by
bjj
Loading…
Fix GuidedDecodingParams backend_name issue
ci/build
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
structured-output
v1
#14473
opened Mar 8, 2025 by
sethkimmel3
Loading…
Fix EAGLE output norm bug
documentation
Improvements or additions to documentation
speculative-decoding
#14464
opened Mar 7, 2025 by
luyuzhe111
Loading…
[INTEL-HPU] Deepseek R1 model enabling for Intel Gaudi
ci/build
#14455
opened Mar 7, 2025 by
xuechendi
Loading…
[Feature]: PD separation supports prefix caching #12257
#14440
opened Mar 7, 2025 by
skyCreateXian
Loading…
[Usage] Refactor speculative decoding configuration and tests
documentation
Improvements or additions to documentation
speculative-decoding
#14434
opened Mar 7, 2025 by
ShangmingCai
Loading…
[Kernel] [V1] Further optimizations to ROCm (Triton) Backend to better handle GQA.
#14431
opened Mar 7, 2025 by
tdoublep
Loading…
[Bugfix][Kernel]: Fix AllSpark kernel compilation errors and enable for CUDA < 12.0
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#14430
opened Mar 7, 2025 by
wyajieha
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.