Add LLaVA support #775

LinkerCodeMonkey · 2023-08-17T02:39:34Z

We added code to support llava. #307

test code:

from vllm import MLLM, SamplingParams
prompts = [
    "what is doing the man",
    "what you name",
    "what can I do for you",
    "what is doing the man",
】
images = [{
    "src_type": "url",
    "image_src": "IMAGE_URL"}]*4

sampling_params = SamplingParams(temperature=0.8, top_p=0.5, max_tokens=1024)
model,tokenizer = "/PATH/LLaVA-13b-delta-v1-1", "/PATH/LLaVA-13b-delta-v1-1"
gpu_memory_utilization = 0.9
mllm = MLLM(model=model,tokenizer=tokenizer, gpu_memory_utilization=gpu_memory_utilization)
outputs = mllm.generate(prompts, images, sampling_params)
# Print the outputs.
for output in outputs:
    prompt = output.prompt
    generated_text = output.outputs[0].text
    print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")

teraktor2006 · 2023-10-08T22:51:51Z

Thanks. Is it working with LLaVA1.5?

hmellor · 2024-03-28T14:10:46Z

@LinkerCodeMonkey do you still plan to work on this PR?

WoosukKwon · 2024-04-12T07:28:29Z

Closed as we added support for LLaVA in #3042

Support mla (vllm-project#775) [Deepseek r1] improve latenct cache by save last 64 in key cache and 512 in value cache (vllm-project#804) Before, we can only allocate 1854 blocks with 29.2G, now we are able to allocate 3156 blocks Performance wise, not visible regression and able to push to higher batch_size or longer context length --------- Signed-off-by: Chendi Xue <[email protected]>

### What this PR does / why we need it? This is a continuing work of vllm-project#716. This PR add workflow to build and release wheel, and also release source to PYPI. We have 3 conditions to trigger the workflow: 1. PR to `main` and `*-dev` 2. push to `main` and `*-dev` 3. push tag with name of `v*` Release to PYPI will only be done under condition 3. Under condition 1 and 2, it will generate .tar.gz and build .whl, upload to github artifacts but will not release. update: Will build .whl and upload to github artifacts with scheduled task. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? All triggered conditions are well tested with my fork repo. --------- Signed-off-by: Shuqiao Li <[email protected]> Signed-off-by: Yikun Jiang <[email protected]> Co-authored-by: Yikun Jiang <[email protected]>

update dsr1 launch script & add benchmark script

* Fix linux builds with cibuildwheel Build is broken because torch 2.7 is not supported by CUDA 12.4 and torch 2.7 does not have distributions for manylinux_2_17_x86_64. This commit updates the manylinux cuda image used by cibuilder to manylinux_2_28_x86_64 which is supported by torch 2.7 Signed-off-by: Martin Hickey <[email protected]> * Remove cibuild configuration duplication Configuration for cibuilder was set in pyproject.toml and also as env variables in GitHub publish workflow. This commit removes the configuration in workflow and centralizes it in pyproject.toml. Signed-off-by: Martin Hickey <[email protected]> * Clean more disk space from runner Running out of space when building cp312-manylinux_x86_64 wheel. This step is to remove more space. Signed-off-by: Martin Hickey <[email protected]> * Refactor runner disk cleanup Signed-off-by: Martin Hickey <[email protected]> --------- Signed-off-by: Martin Hickey <[email protected]>

support llava

82ef7eb

LinkerCodeMonkey changed the title ~~add llava support~~ Add LLaVA support Aug 17, 2023

Merge branch 'main' into llava

bd0db3b

zhuohan123 force-pushed the main branch from 3affdce to 0080d83 Compare August 30, 2023 09:26

zhuohan123 added the new-model Requests to new models label Sep 12, 2023

nitky mentioned this pull request Oct 9, 2023

[Question] Usage with Multimodal LLM #307

Closed

zhuohan123 mentioned this pull request Jan 31, 2024

[Roadmap] vLLM Roadmap Q1 2024 #2681

Closed

30 tasks

WoosukKwon closed this Apr 12, 2024

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Feb 7, 2025

Support mla (vllm-project#775)

55809c9

zhyajie pushed a commit to zhyajie/vllm that referenced this pull request Oct 30, 2025

Merge pull request vllm-project#775 from ROCm/yuhua/benchmark

e971e8e

update dsr1 launch script & add benchmark script

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add LLaVA support #775

Add LLaVA support #775

Uh oh!

LinkerCodeMonkey commented Aug 17, 2023

Uh oh!

teraktor2006 commented Oct 8, 2023

Uh oh!

hmellor commented Mar 28, 2024

Uh oh!

WoosukKwon commented Apr 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Add LLaVA support #775

Add LLaVA support #775

Uh oh!

Conversation

LinkerCodeMonkey commented Aug 17, 2023

Uh oh!

teraktor2006 commented Oct 8, 2023

Uh oh!

hmellor commented Mar 28, 2024

Uh oh!

WoosukKwon commented Apr 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants