Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[BugFix][V1] Fix parallel sampling finishing/aborts bug Something isn't working v1
#14512 opened Mar 9, 2025 by njhill Loading…
[Frontend] Fix typo in tool chat templates for llama3.2 and toolace documentation Improvements or additions to documentation
#14501 opened Mar 8, 2025 by bjj Loading…
[Misc] Unify formatter and linter to use ruff ci/build documentation Improvements or additions to documentation misc v1
#14485 opened Mar 8, 2025 by aarnphm Loading…
Fix GuidedDecodingParams backend_name issue ci/build documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) structured-output v1
#14473 opened Mar 8, 2025 by sethkimmel3 Loading…
[core][V1] pluggable scheduler v1
#14466 opened Mar 7, 2025 by joerunde Loading…
Fix EAGLE output norm bug documentation Improvements or additions to documentation speculative-decoding
#14464 opened Mar 7, 2025 by luyuzhe111 Loading…
[ROCm][Kernel] MoE weights padding
#14454 opened Mar 7, 2025 by gshtras Loading…
[ROCm] Fix kernel cache miss in Triton FA
#14448 opened Mar 7, 2025 by hyoon1 Loading…
[BUGFIX] fix the need_recv method of model_runner
#14436 opened Mar 7, 2025 by maobaolong Loading…
[Usage] Refactor speculative decoding configuration and tests documentation Improvements or additions to documentation speculative-decoding
#14434 opened Mar 7, 2025 by ShangmingCai Loading…
[Bugfix][Kernel]: Fix AllSpark kernel compilation errors and enable for CUDA < 12.0 ci/build ready ONLY add when PR is ready to merge/full CI is needed
#14430 opened Mar 7, 2025 by wyajieha Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.