bugfix: llava-hf/llava-interleave-qwen-7b-hf (#2497) #2657

deepindeed2022 · 2024-10-25T03:26:53Z

Motivation

python3 -m lmdeploy.serve.openai.api_server path/to/llava_hf/llava-interleave-qwen-7b-hf

Traceback (most recent call last):
  File "/usr/lib/python3.8/runpy.py", line 194, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/usr/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/data/willow/Repo/lmdeploy/lmdeploy/serve/openai/api_server.py", line 1376, in <module>
    fire.Fire(serve)
  File "/opt/py38/lib/python3.8/site-packages/fire/core.py", line 143, in Fire
    component_trace = _Fire(component, args, parsed_flag_args, context, name)
  File "/opt/py38/lib/python3.8/site-packages/fire/core.py", line 477, in _Fire
    component, remaining_args = _CallAndUpdateTrace(
  File "/opt/py38/lib/python3.8/site-packages/fire/core.py", line 693, in _CallAndUpdateTrace
    component = fn(*varargs, **kwargs)
  File "/data/willow/Repo/lmdeploy/lmdeploy/serve/openai/api_server.py", line 1333, in serve
    VariableInterface.async_engine = pipeline_class(
  File "/data/willow/Repo/lmdeploy/lmdeploy/serve/vl_async_engine.py", line 21, in __init__
    self.vl_encoder = ImageEncoder(model_path,
  File "/data/willow/Repo/lmdeploy/lmdeploy/vl/engine.py", line 85, in __init__
    self.model = load_vl_model(model_path, backend_config=backend_config)
  File "/data/willow/Repo/lmdeploy/lmdeploy/vl/model/builder.py", line 56, in load_vl_model
    return module(**kwargs)
  File "/data/willow/Repo/lmdeploy/lmdeploy/vl/model/base.py", line 31, in __init__
    self.build_model()
  File "/data/willow/Repo/lmdeploy/lmdeploy/vl/model/llava_hf.py", line 37, in build_model
    load_checkpoint_and_dispatch(
  File "/opt/py38/lib/python3.8/site-packages/accelerate/big_modeling.py", line 604, in load_checkpoint_and_dispatch
    device_map = infer_auto_device_map(
  File "/opt/py38/lib/python3.8/site-packages/accelerate/utils/modeling.py", line 1240, in infer_auto_device_map
    if check_tied_parameters_in_config(model) and len(tied_parameters) == 0:
  File "/opt/py38/lib/python3.8/site-packages/accelerate/utils/modeling.py", line 574, in check_tied_parameters_in_config
    and model.get_output_embeddings()
  File "/opt/py38/lib/python3.8/site-packages/transformers/models/llava/modeling_llava.py", line 260, in get_output_embeddings
    return self.language_model.get_output_embeddings()
AttributeError: 'NoneType' object has no attribute 'get_output_embeddings'

Modification

model init attribute error fix
add vision-max-batch-size for openai/api_server start config， reference from lmdeploy/cli/serve.py

BC-breaking (Optional)

Does the modification introduce changes that break the backward-compatibility of the downstream repositories?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Use cases (Optional)

we have test llava_hf/llava-interleave-qwen-7b-hf, and support vision encoder with config in start command. such as:
python3 -m lmdeploy.serve.openai.api_server path/to/llava_hf/llava-interleave-qwen-7b-hf --vision-max-batch-size 16

AllentDan · 2024-10-25T06:40:27Z

lmdeploy/serve/openai/api_server.py

@@ -1054,13 +1054,15 @@ def serve(model_path: str,

    _, pipeline_class = get_task(model_path)

+    vision_config = VisionConfig(kwargs.get("vision_max_batch_size", 1))


we already set it in https://github.com/InternLM/lmdeploy/blob/v0.6.1/lmdeploy/cli/utils.py#L476

thanks, I will use the options to setting max_batch_size

irexyc · 2024-10-25T10:24:43Z

I suggest just set model.config.tie_word_embeddings to False before using load_checkpoint_and_dispatch

- fix init raise exception because tie_word_embeddings config

AllentDan

LGTM

lvhan028 · 2024-10-28T03:14:21Z

@deepindeed2022 please resolve the linting error

pip install pre-commit
cd lmdeploy # the root directory of the repo
pre-commit install
pre-commit run --all-files

- fix init raise exception because tie_word_embeddings config

lvhan028 requested review from AllentDan and irexyc October 25, 2024 06:12

lvhan028 added the Bug:P1 label Oct 25, 2024

AllentDan reviewed Oct 25, 2024

View reviewed changes

bugfix: llava-hf/llava-interleave-qwen-7b-hf (InternLM#2497)

092d960

- fix init raise exception because tie_word_embeddings config

deepindeed2022 force-pushed the main branch from 6ef3a7e to 092d960 Compare October 26, 2024 12:46

AllentDan approved these changes Oct 28, 2024

View reviewed changes

irexyc approved these changes Oct 28, 2024

View reviewed changes

lvhan028 merged commit 39de575 into InternLM:main Oct 28, 2024
4 of 5 checks passed

AllentDan pushed a commit to AllentDan/lmdeploy that referenced this pull request Nov 13, 2024

bugfix: llava-hf/llava-interleave-qwen-7b-hf (InternLM#2657)

c1e3ab7

- fix init raise exception because tie_word_embeddings config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: llava-hf/llava-interleave-qwen-7b-hf (#2497) #2657

bugfix: llava-hf/llava-interleave-qwen-7b-hf (#2497) #2657

deepindeed2022 commented Oct 25, 2024

AllentDan Oct 25, 2024

deepindeed2022 Oct 26, 2024

irexyc commented Oct 25, 2024

AllentDan left a comment

lvhan028 commented Oct 28, 2024

		@@ -1054,13 +1054,15 @@ def serve(model_path: str,

		_, pipeline_class = get_task(model_path)

		vision_config = VisionConfig(kwargs.get("vision_max_batch_size", 1))

bugfix: llava-hf/llava-interleave-qwen-7b-hf (#2497) #2657

bugfix: llava-hf/llava-interleave-qwen-7b-hf (#2497) #2657

Conversation

deepindeed2022 commented Oct 25, 2024

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

AllentDan Oct 25, 2024

Choose a reason for hiding this comment

deepindeed2022 Oct 26, 2024

Choose a reason for hiding this comment

irexyc commented Oct 25, 2024

AllentDan left a comment

Choose a reason for hiding this comment

lvhan028 commented Oct 28, 2024