Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q4 2024
#9006 opened Oct 1, 2024 by simon-mo
Open 26
vLLM's V1 Engine Architecture
#8779 opened Sep 24, 2024 by simon-mo
Open 10
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: IBM Granite 3.1 tool parser fails bug Something isn't working
#11402 opened Dec 22, 2024 by K-Mistele
1 task done
[RFC]: Flexible Weight Sync for vLLM Workers RFC
#11399 opened Dec 21, 2024 by ZSL98
1 task done
[Installation]: cannot install vllm with openvino backend installation Installation problems
#11398 opened Dec 21, 2024 by yuzisun
1 task done
[Feature]: obtain logits feature request
#11397 opened Dec 21, 2024 by zhc7
1 task done
[RFC]: Hybrid Memory Allocator RFC
#11382 opened Dec 20, 2024 by heheda12345
1 task done
[Bug]: Guided decoding crashes for GLM-4 model bug Something isn't working
#11377 opened Dec 20, 2024 by frankang
1 task done
[Bug]: vLLM crashes on tokenized embedding input bug Something isn't working
#11375 opened Dec 20, 2024 by FriedrichBethke
1 task done
[Bug]: vllm serve fails when passing --skip-tokenizer-init flag bug Something isn't working
#11374 opened Dec 20, 2024 by ishitamed19
1 task done
[Bug]: Prefix caching doesn't work for LlavaOneVision bug Something isn't working
#11371 opened Dec 20, 2024 by sleepwalker2017
1 task done
[Bug]: vllm 0.6.3.post1 crash when deploy qwen2vl 72b bug Something isn't working
#11356 opened Dec 20, 2024 by xxlight
1 task done
[New Model]: answerdotai/ModernBERT-large new model Requests to new models
#11347 opened Dec 19, 2024 by pooyadavoodi
1 task
[Performance]: 1P1D Disaggregation performance performance Performance-related issues
#11345 opened Dec 19, 2024 by Jeffwan
1 task done
[Bug]: Paligemma 2 model loading error bug Something isn't working
#11343 opened Dec 19, 2024 by mmderakhshani
1 task done
ProTip! What’s not been updated in a month: updated:<2024-11-21.