[Model][VLM] Support Bee-8B Model by uyzhang · Pull Request #27012 · vllm-project/vllm

uyzhang · 2025-10-16T10:20:52Z

Purpose

Example Serving Command

vllm serve \
    Open-Bee/Bee-8B-RL \
    --served-model-name bee-8b-rl \
    --tensor-parallel-size 8 \
    --gpu-memory-utilization 0.8 \
    --host 0.0.0.0 \
    --port 8000 \
    --trust-remote-code

Example Offline Inference

import os
from transformers import AutoProcessor
from vllm import LLM, SamplingParams
from PIL import Image
import requests
from io import BytesIO


def load_image(image_path):
    """Load image from URL or local path"""
    if image_path.startswith(('http://', 'https://')):
        response = requests.get(image_path, timeout=10)
        response.raise_for_status()
        image = Image.open(BytesIO(response.content))
    else:
        image = Image.open(image_path)

    # Convert RGBA to RGB if needed
    if image.mode == "RGBA":
        background = Image.new('RGB', image.size, (255, 255, 255))
        background.paste(image, mask=image.split()[-1])
        image = background

    return image.convert("RGB")


def main():

    model_path = "Open-Bee/Bee-8B-RL"

    llm = LLM(
        model=model_path,
        limit_mm_per_prompt={"image": 5},
        trust_remote_code=True,
        tensor_parallel_size=1,
        gpu_memory_utilization=0.8,
    )

    sampling_params = SamplingParams(
        temperature=0.8,
        max_tokens=16384,
    )

    image_url = "http://images.cocodataset.org/val2017/000000039769.jpg"
    image = load_image(image_url)
    text = "Describe this image."

    messages = [
        {
            "role":
            "user",
            "content": [
                {
                    "type": "image",
                    "image": image
                },
                {
                    "type": "text",
                    "text": text
                },
            ],
        },
    ]

    processor = AutoProcessor.from_pretrained(model_path,
                                              trust_remote_code=True)
    prompt = processor.apply_chat_template(
        messages,
        tokenize=False,
        add_generation_prompt=True,
    )

    mm_data = {"image": image}
    llm_inputs = {
        "prompt": prompt,
        "multi_modal_data": mm_data,
    }

    outputs = llm.generate([llm_inputs], sampling_params=sampling_params)
    generated_text = outputs[0].outputs[0].text

    print(generated_text)


if __name__ == '__main__':
    main()

Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

mergify · 2025-10-16T10:21:28Z

Documentation preview: https://vllm--27012.org.readthedocs.build/en/27012/

gemini-code-assist

Code Review

This pull request adds support for the Bee-8B model. The implementation correctly integrates the model into vLLM by inheriting from existing LLaVA-like model classes and providing a model-specific multimodal projector and processing logic. The changes also include documentation updates, examples, and tests. I've found one critical compatibility issue in the implementation that needs to be addressed.

vllm/model_executor/models/bee.py

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Yi Zhang <zhangyi970819@gmail.com>

ywang96

Thanks for your contribution! LGTM

Signed-off-by: uyzhang <yi.zhang.4096@gmail.com> Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Roger Wang <hey@rogerw.io>

Signed-off-by: uyzhang <yi.zhang.4096@gmail.com> Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Roger Wang <hey@rogerw.io> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

Signed-off-by: uyzhang <yi.zhang.4096@gmail.com> Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Roger Wang <hey@rogerw.io>

uyzhang and others added 6 commits October 16, 2025 18:17

[Model][VLM] Support Bee-8B Model

e6ef04a

Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

Update docs/models/supported_models.md

6da2705

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

Update test_common.py

033b3f8

Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

Update bee.py

9a040b7

Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

Update bee.py

260c635

Signed-off-by: Yi Zhang <zhangyi970819@gmail.com> Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

update

0e766eb

Signed-off-by: uyzhang <yi.zhang.4096@gmail.com>

uyzhang requested review from DarkLight1337 and ywang96 as code owners October 16, 2025 10:20

mergify bot added documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) new-model Requests to new models labels Oct 16, 2025

mergify bot mentioned this pull request Oct 16, 2025

[Model][VLM] Support Bee-8B Model #27001

Closed

gemini-code-assist bot reviewed Oct 16, 2025

View reviewed changes

vllm/model_executor/models/bee.py Outdated Show resolved Hide resolved

uyzhang and others added 2 commits October 16, 2025 18:24

Update vllm/model_executor/models/bee.py

425d1f1

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Yi Zhang <zhangyi970819@gmail.com>

Merge branch 'main' into main

891b2fc

ywang96 approved these changes Oct 17, 2025

View reviewed changes

Merge branch 'main' into main

9b957f2

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 17, 2025

ywang96 enabled auto-merge (squash) October 17, 2025 07:20

DarkLight1337 added this to the v0.11.1 milestone Oct 17, 2025

Merge branch 'main' into main

6f400df

ywang96 disabled auto-merge October 19, 2025 21:55

ywang96 enabled auto-merge (squash) October 19, 2025 21:55

ywang96 merged commit f32bf75 into vllm-project:main Oct 20, 2025
55 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model][VLM] Support Bee-8B Model#27012

[Model][VLM] Support Bee-8B Model#27012
ywang96 merged 10 commits intovllm-project:mainfrom
uyzhang:main

uyzhang commented Oct 16, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Oct 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

ywang96 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

uyzhang commented Oct 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Example Serving Command

Example Offline Inference

Uh oh!

mergify bot commented Oct 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ywang96 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

uyzhang commented Oct 16, 2025 •

edited by github-actions bot

Loading