[WIP] Support Multimodal Input for Qwen2.5 Omni#57
Closed
tzhouam wants to merge 25 commits intovllm-project:mainfrom
Closed
[WIP] Support Multimodal Input for Qwen2.5 Omni#57tzhouam wants to merge 25 commits intovllm-project:mainfrom
tzhouam wants to merge 25 commits intovllm-project:mainfrom
Conversation
…Added detailed logging for prompt loading failures, file writing issues, and stage shutdown processes. Introduced a new logging utility for orchestrator metrics and streamlined stats handling in the PipelinedOmniLLM class.
…ded support for loading prompts from a .pt file, introduced new command-line arguments for initialization and output handling, and improved error handling for prompt loading. Removed deprecated files related to previous implementations.
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
…r loading prompts from a .pt file. Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
…r loading prompts from a .pt file. Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
…pport for loading prompts from a .pt file." This reverts commit c18fc40.
…e and support for loading prompts from a .pt file."" This reverts commit 5f3f53a.
…houam/vllm-omni into feat/multi-request-stream-new
…houam/vllm-omni into feat/multi-request-stream-new
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
…pdating stage configuration to remove unnecessary process flags. This simplifies the YAML configuration for the Qwen2.5 Omni model. Signed-off-by: tzhouam <tzhouam@connect.ust.hk>
5 tasks
Collaborator
|
Closed as its extended version PR #76 has been merged. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Purpose
This PR is to support the multimodal input for Qwen2.5 Omni (Qwen3 Omni should be similar), where only support one request with audio and should be done later.
Test Plan
Get into the example folder
cd examples/offline_inference/qwen_2_5_omniModify in the cmd below as your path of vllm_omni. Then run.
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.BEFORE SUBMITTING, PLEASE READ https://github.com/hsliuustc0106/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)