OOM when use InternVL2_5-1B-MPO #3143

BobHo5474 · 2025-02-14T08:46:52Z

I followed the installation guide to build mldeploy (0.7.0.post3) from source.
Inference using the PyTorch engine works fine.
However, after quantizing the model to 4-bit using AWQ, I encountered an OOM error when loading the model with the TurboMind engine.
I try to set "session_len=2048" in TurbomindEngineConfig.

lvhan028 · 2025-02-14T09:00:03Z

Can you share the following information?

running lmdeploy check_env
the reproducible code

BobHo5474 · 2025-02-14T09:17:47Z

I will get the error when I run lmdeploy check_env because I built lmdeploy on Jetson Orin.

Below is the code,
from lmdeploy import pipeline, TurbomindEngineConfig, PytorchEngineConfig
pipe = pipeline("./InternVL2_5-1B-MPO-4bit/", backend_config=TurbomindEngineConfig(model_format="awq", session_len=2048))

I run lmdeploy lite auto_awq OpenGVLab/InternVL2_5-1B-MPO --work-dir InternVL2_5-1B-MPO-4bit to quantize model.

lvhan028 · 2025-02-14T09:35:31Z

Can you help open INFO log level? Let's check what the log indicates

from lmdeploy import pipeline, TurbomindEngineConfig, PytorchEngineConfig
pipe = pipeline("./InternVL2_5-1B-MPO-4bit/", backend_config=TurbomindEngineConfig(model_format="awq",session_len=2048), log_level='INFO')

lvhan028 · 2025-02-14T09:36:36Z

What's the mem size of jetson orin?

BobHo5474 · 2025-02-14T09:40:04Z

GPU memory size is 16GB, and I uploaded the log file. log.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOM when use InternVL2_5-1B-MPO #3143

OOM when use InternVL2_5-1B-MPO #3143

BobHo5474 commented Feb 14, 2025

lvhan028 commented Feb 14, 2025

BobHo5474 commented Feb 14, 2025

lvhan028 commented Feb 14, 2025

lvhan028 commented Feb 14, 2025

BobHo5474 commented Feb 14, 2025

OOM when use InternVL2_5-1B-MPO #3143

OOM when use InternVL2_5-1B-MPO #3143

Comments

BobHo5474 commented Feb 14, 2025

lvhan028 commented Feb 14, 2025

BobHo5474 commented Feb 14, 2025

lvhan028 commented Feb 14, 2025

lvhan028 commented Feb 14, 2025

BobHo5474 commented Feb 14, 2025