[Docs][Model] Support Qwen3-VL-Embedding & Qwen3-VL-Reranker by gcanlin · Pull Request #6034 · vllm-project/vllm-ascend

gcanlin · 2026-01-20T05:16:16Z

What this PR does / why we need it?

Add docs for Qwen3-VL-Embedding & Qwen3-VL-Reranker.

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@2c24bc6

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gemini-code-assist

Code Review

This pull request adds documentation for the Qwen3-VL-Embedding and Qwen3-VL-Reranker models. The changes are good and provide useful examples for users. My review includes a few suggestions to improve the correctness of code examples and fix broken links to provide a better user experience. Specifically, I've pointed out a prompt formatting issue, a placeholder path that needs clarification, a character that breaks a URL, and incorrect links to the benchmark documentation.

github-actions · 2026-01-20T05:21:06Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin · 2026-01-20T05:28:04Z

@wangxiyuan @Yikun This PR is ready. Could you please take a look? More and more users are asking requests for these models.

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

…to FIA_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: (24 commits) add dispath_ffn_combine_bf16 (vllm-project#5866) [BugFix] Fix input parameter bug of dispatch_gmm_combine_decode[RFC: issue 5476] (vllm-project#5932) [1/N][Feat] Xlite Qwen3 MoE Support (vllm-project#5951) [Bugfix] Fix setting of `speculative_config.enforce_eager` for dsv32 (vllm-project#5945) [bugfix][mm] change get_num_encoder_tokens to get_num_encoder_embeds in recompute_schedule.py (vllm-project#5132) [Bugfix] fix pcp qwen full graph FIA bug (vllm-project#6037) [Bugfix]Fixed precision issues caused by pooled request pooling (vllm-project#6049) 【main】【bugfix】Resolved memory deallocation failure in the pooling layer under re-computation workloads. (vllm-project#6045) [main][Bugfix] Fixed an problem related to embeddings sharing (vllm-project#5967) [Feature]refactor the npugraph_ex config, support online-infer with static kernel (vllm-project#5775) [CI][Lint] Show lint diff on failure (vllm-project#5956) [CI] Add wait logic for each individual case (vllm-project#6036) [CI] Add DeepSeek-V3.2-W8A8 nightly ci test (vllm-project#4633) model runner v2 support triton of penalty (vllm-project#5854) [Docs][Model] Support Qwen3-VL-Embedding & Qwen3-VL-Reranker (vllm-project#6034) [Tests] move qwen3 performance test from nightly to e2e (vllm-project#5980) [Bugfix] fix bug of pcp+mtp+async scheduler (vllm-project#5994) [Main2Main] Upgrade vllm commit to releases/v0.14.0 (vllm-project#5988) [Ops] Add layernorm for qwen3Next (vllm-project#5765) [Doc] Add layer_sharding additional config for DeepSeek-V3.2-W8A8 (vllm-project#5921) ...

…oject#6034) ### What this PR does / why we need it? Add docs for Qwen3-VL-Embedding & Qwen3-VL-Reranker. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: gcanlin <canlinguosdu@gmail.com>

…oject#6034) ### What this PR does / why we need it? Add docs for Qwen3-VL-Embedding & Qwen3-VL-Reranker. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: gcanlin <canlinguosdu@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…oject#6034) ### What this PR does / why we need it? Add docs for Qwen3-VL-Embedding & Qwen3-VL-Reranker. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: gcanlin <canlinguosdu@gmail.com>

…oject#6034) ### What this PR does / why we need it? Add docs for Qwen3-VL-Embedding & Qwen3-VL-Reranker. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: gcanlin <canlinguosdu@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…oject#6034) ### What this PR does / why we need it? Add docs for Qwen3-VL-Embedding & Qwen3-VL-Reranker. - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@2c24bc6 --------- Signed-off-by: gcanlin <canlinguosdu@gmail.com>

[Docs][Model] Support Qwen3-VL-Embedding & Qwen3-VL-Reranker

95388dc

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin requested review from LCAIZJ, Yikun and wangxiyuan as code owners January 20, 2026 05:16

gemini-code-assist bot reviewed Jan 20, 2026

View reviewed changes

github-actions bot added the documentation Improvements or additions to documentation label Jan 20, 2026

gcanlin added 2 commits January 20, 2026 05:22

fix

829177a

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

add template for reranker

632011a

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin mentioned this pull request Jan 20, 2026

[Model][Docs] Support Qwen3-VL-Embedding & Qwen3-VL-Reranker #5741

Closed

gcanlin added 2 commits January 20, 2026 06:42

fix lint

49ee97b

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

update

b8f99d4

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

wangxiyuan merged commit afabb49 into vllm-project:main Jan 20, 2026
10 checks passed

shen-shanshan mentioned this pull request Jan 20, 2026

[RFC]: Multi-Modal Tasks #3508

Open

28 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Docs][Model] Support Qwen3-VL-Embedding & Qwen3-VL-Reranker#6034

[Docs][Model] Support Qwen3-VL-Embedding & Qwen3-VL-Reranker#6034
wangxiyuan merged 5 commits intovllm-project:mainfrom
gcanlin:docs-vl

gcanlin commented Jan 20, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 20, 2026

Uh oh!

gcanlin commented Jan 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gcanlin commented Jan 20, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 20, 2026

Uh oh!

gcanlin commented Jan 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gcanlin commented Jan 20, 2026 •

edited by github-actions bot

Loading