Upstream marge conflicts fix by knitcapcat-amd · Pull Request #2 · inkcherry/vllm-omni

knitcapcat-amd · 2026-04-20T07:57:20Z

GitHub shows ~591 files diff because base...head includes the 128 upstream commits carried in by the merge. The actual conflict-resolution is entirely in commit b2331a1d — its "Conflict resolution" view on GitHub shows only 3 files.

Conflicts resolved

qwen2_5_omni_multiconnector.yaml — dropped (moved to vllm_omni/deploy/ by upstream refactor [Config Refactor 2.5/N] Centralize pipeline registry vllm-project/vllm-omni#2915; mori_connector registration preserved in qwen2_5_omni_mori_intranode.yaml)
async_omni_engine.py — kept both mori receiver_connectors path and upstream's pd_config detection; pass both to Orchestrator(...)
orchestrator.py — kept both mori machinery (param / field / PUT logic) and upstream's PD state init + mm_features filtering

Verification

pre-commit run --all-files passes
6×MI300X intra-node Qwen2.5-Omni-7B end-to-end: text + audio output correct; Mori PUT 0->1 confirmed (573KB / 120ms / 4.5 MB/s)

…2673) Signed-off-by: fan2956 <zhoufan53@huawei.com>

…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>

Signed-off-by: Jinheng Li <ahengljh@gmail.com> Signed-off-by: Canlin Guo <961750412@qq.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by: Canlin Guo <961750412@qq.com>

…figs (vllm-project#2622) Signed-off-by: Yiyang Liu <yiyangliu@microsoft.com> Co-authored-by: Yiyang Liu <yiyangliu@microsoft.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Signed-off-by: Nick Cao <ncao@redhat.com> Co-authored-by: Claude <noreply@anthropic.com>

Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Co-authored-by: SYLAR <lishunyang12@users.noreply.github.com>

…g ref_text (vllm-project#2203) Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Co-authored-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>

…enchmarks (vllm-project#1971) Signed-off-by: samithuang <285365963@qq.com> Signed-off-by: Samit <285365963@qq.com>

Signed-off-by: akshatvishu <akshatnayak197@gmail.com>

…with cuda tests (vllm-project#2340) Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>

Signed-off-by: Nick Cao <ncao@redhat.com>

…oject#2676) Signed-off-by: JuanPZuluaga <juanz9312@gmail.com>

…sage (vllm-project#2688) Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>

…#2519) Signed-off-by: yuanheng <jonathan.zhaoyh@gmail.com> Signed-off-by: Yuanheng Zhao <jonathan.zhaoyh@gmail.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com> Co-authored-by: SYLAR <125541396+lishunyang12@users.noreply.github.com>

…ookup (vllm-project#2407) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

Signed-off-by: david6666666 <david6666666@users.noreply.github.com> Co-authored-by: david6666666 <david6666666@users.noreply.github.com>

…project#2551) Signed-off-by: gcanlin <canlinguosdu@gmail.com>

…load) (vllm-project#2689) Signed-off-by: Alex Brooks <albrooks@redhat.com> Co-authored-by: SYLAR <125541396+lishunyang12@users.noreply.github.com>

…ss M GPUs (vllm-project#2423)

…2706) Signed-off-by: gcanlin <canlinguosdu@gmail.com>

Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>

…in OmniEngineArgs (vllm-project#2684) Signed-off-by: Zhengyuan Su <su.zhengyuan@u.nus.edu> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…Non-deterministic image quality regression. (vllm-project#2458) Signed-off-by: natureofnature <wzliu@connect.hku.hk>

…tor (vllm-project#2520) Signed-off-by: Sy03 <1370724210@qq.com>

…llm-project#2134) Signed-off-by: Celeste-jq <591998922@qq.com> Co-authored-by: Canlin Guo <canlinguosdu@gmail.com>

…lm-project#2720) Signed-off-by: Yueqian Lin <linyueqian@outlook.com>

…t timeout and stage init timeout in order to resolve the CI timeout error. (vllm-project#2711) Signed-off-by: wangyu <410167048@qq.com>

…h AsyncLL… (vllm-project#2716) Signed-off-by: amy-why-3459 <wuhaiyan17@huawei.com>

…llm-project#1555) Signed-off-by: natureofnature <wzliu@connect.hku.hk> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

Co-authored-by: wuzhongjian <wuzhongjian@cmss.chinamobile.com>

Signed-off-by: bjf-frz <frz123db@gmail.com>

…ect#2691) Signed-off-by: Alex Brooks <albrooks@redhat.com> Co-authored-by: lengrongfu <lenronfu@gmail.com>

Signed-off-by: Alex Brooks <albrooks@redhat.com> Co-authored-by: Didan Deng <33117903+wtomin@users.noreply.github.com>

…_generates_video[wan22_i2v_usp2_hsdp2] (vllm-project#2883) Signed-off-by: wangyu <410167048@qq.com>

Signed-off-by: Lancer <maruixiang6688@gmail.com>

…t#2343) Signed-off-by: Nick Cao <ncao@redhat.com> Co-authored-by: Claude <noreply@anthropic.com>

…ures (vllm-project#1837) Signed-off-by: CHEN <116010019@link.cuhk.edu.cn> Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Signed-off-by: linyueqian <linyueqian@outlook.com> Co-authored-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Co-authored-by: linyueqian <linyueqian@outlook.com>

Signed-off-by: Joshna Medisetty <joshna.medisetty@intel.com> Signed-off-by: Joshna-Medisetty <joshna.medisetty@intel.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

Signed-off-by: Alex Brooks <albrooks@redhat.com>

Signed-off-by: hsliuustc0106 <liuhongsheng4@huawei.com> Signed-off-by: hsliu <liuhongsheng4@huawei.com> Signed-off-by: Hongsheng Liu <liuhongsheng4@huawei.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Signed-off-by: david6666666 <david6666666@users.noreply.github.com> Co-authored-by: david6666666 <david6666666@users.noreply.github.com>

Signed-off-by: Nick Cao <ncao@redhat.com>

…t#2581) Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>

Signed-off-by: princepride <wangzhipeng628@gmail.com> Signed-off-by: 汪志鹏 <wangzhipeng628@gmail.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>

Signed-off-by: Lancer <maruixiang6688@gmail.com> Co-authored-by: Samit <285365963@qq.com>

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

…pt (vllm-project#2894) Signed-off-by: Sy03 <1370724210@qq.com>

…2383) Signed-off-by: lishunyang <lishunyang12@163.com> Signed-off-by: reidliu41 <reid201711@gmail.com> Signed-off-by: Alex Brooks <albrooks@redhat.com> Co-authored-by: reidliu41 <reid201711@gmail.com> Co-authored-by: xiaohajiayou <75477391+xiaohajiayou@users.noreply.github.com> Co-authored-by: Alex Brooks <albrooks@redhat.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

…+decode batches (vllm-project#2903) Signed-off-by: Sy03 <1370724210@qq.com>

…memory (vllm-project#2474) Signed-off-by: willamhou <willamhou@ceresman.com> Co-authored-by: willamhou <willamhou@ceresman.com>

Signed-off-by: xiaohajiayou <923390377@qq.com> Signed-off-by: Samit <285365963@qq.com> Co-authored-by: Samit <285365963@qq.com> Co-authored-by: SYLAR <125541396+lishunyang12@users.noreply.github.com>

…m-project#2018) Signed-off-by: Yuanheng Zhao <jonathan.zhaoyh@gmail.com> Signed-off-by: yuanheng <jonathan.zhaoyh@gmail.com> Co-authored-by: Didan Deng <33117903+wtomin@users.noreply.github.com>

Signed-off-by: lishunyang <lishunyang12@163.com>

…2852) Signed-off-by: fan2956 <zhoufan53@huawei.com>

Signed-off-by: Rein Yang <ruiruyang2@gmail.com>

…m-project#2934) Signed-off-by: amy-why-3459 <wuhaiyan17@huawei.com>

Signed-off-by: Zejian Wang <zejianwang@sjtu.edu.cn>

gemini-code-assist

Code Review

This pull request refactors the deployment and testing infrastructure by introducing a new 'deploy config' schema and migrating several models (Qwen3-Omni, Qwen3-TTS) to it. It adds support for new models like Ming-flash-omni 2.0, VoxCPM, and VoxCPM2, and removes external audio dependencies (librosa, sox) in favor of internal vLLM utilities. The changes also include new Claude skills for contributors, documentation for features like frame interpolation and prefix caching, and various CI pipeline updates. Reviewers identified critical issues where benchmark scripts could silently pass by falling back to empty templates on failure, and noted several instances in new examples where Omni engine instances were not properly closed or librosa was still being imported despite its removal from the project requirements.

gemini-code-assist · 2026-04-20T07:59:53Z

+    if not os.path.exists(result_path):
+        with open(OMNI_RESULT_TEMPLATE_PATH, encoding="utf-8") as f:
+            template_result: dict[str, Any] = json.load(f)
+        Path(result_path).parent.mkdir(parents=True, exist_ok=True)
+        with open(result_path, "w", encoding="utf-8") as f:
+            json.dump(template_result, f, ensure_ascii=False, indent=2)
+        print(f"Benchmark result file not generated, fallback to template: {result_path}")
+        result = template_result
+    else:
+        with open(result_path, encoding="utf-8") as f:
+            result = json.load(f)


Falling back to a zeroed template when the benchmark result file is missing can hide performance regressions or execution failures in CI. Assertions against zeroed metrics (e.g., latency <= threshold) will pass incorrectly. It is better to raise an explicit error here if the benchmark fails to produce data.

result_path = os.path.join(result_dir, result_filename) if not os.path.exists(result_path): raise FileNotFoundError(f"Benchmark result file not generated: {result_path}") with open(result_path, encoding="utf-8") as f: result = json.load(f)

gemini-code-assist · 2026-04-20T07:59:54Z

    if not tmp_result_file.exists():
-        raise FileNotFoundError(f"Benchmark result file not found: {tmp_result_file}")
+        with open(DIFFUSION_RESULT_TEMPLATE_PATH, encoding="utf-8") as f:
+            template_payload = json.load(f)
+        # Template schema is fixed and owned by this repo:
+        # ``diffusion_result_template.json`` is a one-item list and metrics live at [0]["result"].
+        template_metrics: dict[str, Any] = template_payload[0]["result"]
+        with open(tmp_result_file, "w", encoding="utf-8") as f:
+            json.dump(template_metrics, f, ensure_ascii=False, indent=2)
+        print(f"Benchmark result file not generated, fallback to template: {tmp_result_file}")

    try:
        with open(tmp_result_file, encoding="utf-8") as f:
            metrics: dict[str, Any] = json.load(f)


Similar to run_benchmark.py, falling back to a template when the result file is missing hides failures. If the benchmark script crashes or fails to generate output, the test should fail explicitly.

Suggested change

if not tmp_result_file.exists():

raise FileNotFoundError(f"Benchmark result file not found: {tmp_result_file}")

with open(DIFFUSION_RESULT_TEMPLATE_PATH, encoding="utf-8") as f:

template_payload = json.load(f)

# Template schema is fixed and owned by this repo:

# ``diffusion_result_template.json`` is a one-item list and metrics live at [0]["result"].

template_metrics: dict[str, Any] = template_payload[0]["result"]

with open(tmp_result_file, "w", encoding="utf-8") as f:

json.dump(template_metrics, f, ensure_ascii=False, indent=2)

print(f"Benchmark result file not generated, fallback to template: {tmp_result_file}")

try:

with open(tmp_result_file, encoding="utf-8") as f:

metrics: dict[str, Any] = json.load(f)

if not tmp_result_file.exists():

raise FileNotFoundError(f"Benchmark result file not generated: {tmp_result_file}")

try:

with open(tmp_result_file, encoding="utf-8") as f:

metrics: dict[str, Any] = json.load(f)

gemini-code-assist · 2026-04-20T07:59:54Z

+import time
+from typing import NamedTuple
+
+import librosa


Since librosa has been removed from requirements/common.txt in this PR, direct imports and usage of it will fail unless manually installed. Please use vllm.multimodal.media.audio.load_audio instead, which is consistent with other updates in this PR.

Suggested change

import librosa

from vllm.multimodal.media.audio import load_audio

gemini-code-assist · 2026-04-20T07:59:54Z

+    if audio_path:
+        if not os.path.exists(audio_path):
+            raise FileNotFoundError(f"Audio file not found: {audio_path}")
+        audio_signal, sr = librosa.load(audio_path, sr=sampling_rate)


Replace librosa.load with load_audio to avoid dependency issues.

Suggested change

audio_signal, sr = librosa.load(audio_path, sr=sampling_rate)

audio_signal, sr = load_audio(audio_path, sr=sampling_rate)

gemini-code-assist · 2026-04-20T07:59:54Z

+    if audio_path:
+        if not os.path.exists(audio_path):
+            raise FileNotFoundError(f"Audio file not found: {audio_path}")
+        sig, sr = librosa.load(audio_path, sr=sampling_rate)


Replace librosa.load with load_audio to avoid dependency issues.

Suggested change

sig, sr = librosa.load(audio_path, sr=sampling_rate)

sig, sr = load_audio(audio_path, sr=sampling_rate)

gemini-code-assist · 2026-04-20T07:59:54Z

+        default=None,
+        help="Path to local audio file. Uses default asset if not provided.",
+    )
+    parser.add_argument(


A hardcoded 30-second sleep for profiler trace flushing may be excessive or insufficient depending on the environment. Consider making this configurable or using a more robust synchronization mechanism.

gemini-code-assist · 2026-04-20T07:59:54Z

+    omni = Omni(
+        model=args.model,
+        stage_configs_path=args.stage_configs_path,
+        log_stats=args.log_stats,
+        stage_init_timeout=args.stage_init_timeout,
+    )


The Omni instance created in _run_sync is never closed, which can lead to leaked resources (e.g., orphaned stage worker processes). Please wrap the usage in a with block or call omni.close() in a finally block.

Suggested change

omni = Omni(

model=args.model,

stage_configs_path=args.stage_configs_path,

log_stats=args.log_stats,

stage_init_timeout=args.stage_init_timeout,

)

with Omni(

model=args.model,

stage_configs_path=args.stage_configs_path,

log_stats=args.log_stats,

stage_init_timeout=args.stage_init_timeout,

) as omni:

gemini-code-assist · 2026-04-20T07:59:54Z

+    engine = Omni(
+        model=args.model,
+        stage_configs_path=args.stage_configs_path,
+    )


The Omni instance (engine) is not closed at the end of main(), which may leak background processes. Please use a context manager.

Suggested change

engine = Omni(

model=args.model,

stage_configs_path=args.stage_configs_path,

)

with Omni(

model=args.model,

stage_configs_path=args.stage_configs_path,

) as engine:

fan2956 and others added 30 commits April 10, 2026 16:18

[Bugfix] fix mindiesd laserattention unsupported error (vllm-project#…

c2ae58b

…2673) Signed-off-by: fan2956 <zhoufan53@huawei.com>

[Bugfix]: modify diffusion pipeline profiler result in videos (vllm-p…

fbb5dd5

…roject#2647) Signed-off-by: bjf-frz <frz123db@gmail.com>

[Profiler] Add Nsight Systems support for serving (vllm-project#1098)

78bef62

Signed-off-by: Jinheng Li <ahengljh@gmail.com> Signed-off-by: Canlin Guo <961750412@qq.com> Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by: Canlin Guo <961750412@qq.com>

[Refactor] Remove dependency on librosa (vllm-project#2273)

2bc183f

Signed-off-by: Nick Cao <ncao@redhat.com> Co-authored-by: Claude <noreply@anthropic.com>

[Model] VoxCPM2 native AR TTS support (vllm-project#2658)

a41174e

Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Co-authored-by: SYLAR <lishunyang12@users.noreply.github.com>

[Doc] Add LTX-2 online serving deployment recipes with optimization b…

d1fef41

…enchmarks (vllm-project#1971) Signed-off-by: samithuang <285365963@qq.com> Signed-off-by: Samit <285365963@qq.com>

[feature] : add cache-dit for stable-audio-open-1.0 (vllm-project#1341)

c9e8411

Signed-off-by: akshatvishu <akshatnayak197@gmail.com>

[ROCm] [CI] [Bugfix] Resurface CI Signal, fix MHA AR selection, sync …

25c0566

…with cuda tests (vllm-project#2340) Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>

[Perf] Use global CUDA graph pool for MiMo Audio (vllm-project#2657)

eccee21

Signed-off-by: Nick Cao <ncao@redhat.com>

[TTS][OmniVoice] Add voice cloning support for OmniVoice TTS (vllm-pr…

f7e8df9

…oject#2676) Signed-off-by: JuanPZuluaga <juanz9312@gmail.com>

[CI] [Resource] Remove unused test cases to cutdown agent resources u…

6e93595

…sage (vllm-project#2688) Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>

[Bugfix] Validate speaker in chat endpoint and fix case-insensitive l…

38dfe56

…ookup (vllm-project#2407) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

[Docs] Update WeChat QR code for community support (vllm-project#2701)

73fb68a

Signed-off-by: david6666666 <david6666666@users.noreply.github.com> Co-authored-by: david6666666 <david6666666@users.noreply.github.com>

[Log] Wire stat loggers into AsyncOmniEngine to match AsyncLLM (vllm-…

5d58abb

…project#2551) Signed-off-by: gcanlin <canlinguosdu@gmail.com>

[Bugfix] Fix Incompatible Multihook Integration (TeaCache <-> CPU Off…

ef230ac

…load) (vllm-project#2689) Signed-off-by: Alex Brooks <albrooks@redhat.com> Co-authored-by: SYLAR <125541396+lishunyang12@users.noreply.github.com>

[Refactor] Extend CFG Parallel to support 3 or 4 branch dispatch acro…

16041ab

…ss M GPUs (vllm-project#2423)

[Bugfix] Fix UT for the missing of log_stats in Engine (vllm-project#…

95b5b2e

…2706) Signed-off-by: gcanlin <canlinguosdu@gmail.com>

[ROCm] [CI] Fix environment issue (vllm-project#2708)

2dce028

Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>

[Feat] Override single stage CLI args when stage_configs_path is set …

eb1a801

…in OmniEngineArgs (vllm-project#2684) Signed-off-by: Zhengyuan Su <su.zhengyuan@u.nus.edu> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

[Bugfix] Fix Bagel online mode for 1. Hang after several requests 2. …

e122501

…Non-deterministic image quality regression. (vllm-project#2458) Signed-off-by: natureofnature <wzliu@connect.hku.hk>

[Perf][Fish Speech] Enable CUDA Graph capture for Fast AR code predic…

cb4d13a

…tor (vllm-project#2520) Signed-off-by: Sy03 <1370724210@qq.com>

[Model] Adapt Wan2.2-I2V-A14B via LightX2V offline conversion path (v…

8097747

…llm-project#2134) Signed-off-by: Celeste-jq <591998922@qq.com> Co-authored-by: Canlin Guo <canlinguosdu@gmail.com>

[Fix] VoxCPM2: support raw audio for voice cloning via OpenAI API (vl…

d9e745c

…lm-project#2720) Signed-off-by: Yueqian Lin <linyueqian@outlook.com>

[CI][Bugfix] Refactor the test case to add support for increasing ini…

2226143

…t timeout and stage init timeout in order to resolve the CI timeout error. (vllm-project#2711) Signed-off-by: wangyu <410167048@qq.com>

[Revert] Revert "[Log] Wire stat loggers into AsyncOmniEngine to matc…

2b70e89

…h AsyncLL… (vllm-project#2716) Signed-off-by: amy-why-3459 <wuhaiyan17@huawei.com>

[core]refactor communication layer: PR1(Added Refactor Infra Only) (v…

0d4e975

…llm-project#1555) Signed-off-by: natureofnature <wzliu@connect.hku.hk> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

[Feature]: support Flux.2-dev tea_cache (vllm-project#1871)

cd2761e

Co-authored-by: wuzhongjian <wuzhongjian@cmss.chinamobile.com>

bjf-frz and others added 28 commits April 17, 2026 20:13

[Refactor] refactor wan2.2 diffuse && add ut (vllm-project#2672)

18ac679

Signed-off-by: bjf-frz <frz123db@gmail.com>

[Misc] Warn When vLLM / vLLM-Omni Have Mismatched Versions (vllm-proj…

6c57ab7

…ect#2691) Signed-off-by: Alex Brooks <albrooks@redhat.com> Co-authored-by: lengrongfu <lenronfu@gmail.com>

[Bugfix] Fix cache dit for Longcat & LTX2 (vllm-project#2860)

536f59b

Signed-off-by: Alex Brooks <albrooks@redhat.com> Co-authored-by: Didan Deng <33117903+wtomin@users.noreply.github.com>

[CI] Skip test_bagel[parallel_tp_2] and test_wan22_i2v_online_serving…

b4add5b

…_generates_video[wan22_i2v_usp2_hsdp2] (vllm-project#2883) Signed-off-by: wangyu <410167048@qq.com>

[Bugfix] fix CI failure (vllm-project#2884)

64d368d

Signed-off-by: Lancer <maruixiang6688@gmail.com>

[Cleanup] Remove dead runtime.defaults config parameters (vllm-projec…

f2edb81

…t#2343) Signed-off-by: Nick Cao <ncao@redhat.com> Co-authored-by: Claude <noreply@anthropic.com>

Nextstep online e2e (vllm-project#2107)

b5ddff7

Signed-off-by: Joshna Medisetty <joshna.medisetty@intel.com> Signed-off-by: Joshna-Medisetty <joshna.medisetty@intel.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

Add Teacache Support for LongCat Image (vllm-project#1487)

f346f2f

Signed-off-by: Alex Brooks <albrooks@redhat.com>

[Docs] Update WeChat QR code for community support (vllm-project#2895)

4f71f73

Signed-off-by: david6666666 <david6666666@users.noreply.github.com> Co-authored-by: david6666666 <david6666666@users.noreply.github.com>

[Refactor] Remove resampy dependency (vllm-project#2891)

d2c23d7

Signed-off-by: Nick Cao <ncao@redhat.com>

[Feature]Support audio streaming input and output-phase2 (vllm-projec…

4124a1f

…t#2581) Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>

[BugFix]: Fix multi-stage cfg bug (vllm-project#2801)

768931e

Signed-off-by: princepride <wangzhipeng628@gmail.com> Signed-off-by: 汪志鹏 <wangzhipeng628@gmail.com> Co-authored-by: Hongsheng Liu <liuhongsheng4@huawei.com>

[doc][skip ci] remove redundant content in readme (vllm-project#2901)

fe6cec6

Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>

[Feat] cache-dit for GLM-Image (vllm-project#1399)

9cf1fe7

Signed-off-by: Lancer <maruixiang6688@gmail.com> Co-authored-by: Samit <285365963@qq.com>

[Agent] Add NPU main2main skill (vllm-project#2858)

9313f37

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

[Bugfix][VoxCPM2] Fix voice-clone decode loop by padding prefill prom…

a683b1d

…pt (vllm-project#2894) Signed-off-by: Sy03 <1370724210@qq.com>

[Bugfix][VoxCPM2]: Fix vectorized_gather OOB under concurrent prefill…

26edc7f

…+decode batches (vllm-project#2903) Signed-off-by: Sy03 <1370724210@qq.com>

perf(helios): replace strided RoPE with stack+flatten for contiguous …

1568451

…memory (vllm-project#2474) Signed-off-by: willamhou <willamhou@ceresman.com> Co-authored-by: willamhou <willamhou@ceresman.com>

[Config Refactor 2.5/N] Centralize pipeline registry (vllm-project#2915)

cd384d9

Signed-off-by: lishunyang <lishunyang12@163.com>

[Perf] Optimize Wan2.2 device free on image preprocess (vllm-project#…

78f237e

…2852) Signed-off-by: fan2956 <zhoufan53@huawei.com>

[Docs] update documents (vllm-project#2921)

d435fe0

Signed-off-by: Rein Yang <ruiruyang2@gmail.com>

[BugFix] Fixed the issue where --no-async-chunk was not working. (vll…

0393c58

…m-project#2934) Signed-off-by: amy-why-3459 <wuhaiyan17@huawei.com>

Merge upstream/main into moriio-fix

b2331a1

Signed-off-by: Zejian Wang <zejianwang@sjtu.edu.cn>

gemini-code-assist Bot reviewed Apr 20, 2026

View reviewed changes

inkcherry merged commit 87f0518 into inkcherry:moriio Apr 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upstream marge conflicts fix#2

Upstream marge conflicts fix#2
inkcherry merged 129 commits intoinkcherry:moriiofrom
knitcapcat-amd:moriio-fix

knitcapcat-amd commented Apr 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

	import librosa
	from vllm.multimodal.media.audio import load_audio

	audio_signal, sr = librosa.load(audio_path, sr=sampling_rate)
	audio_signal, sr = load_audio(audio_path, sr=sampling_rate)

	sig, sr = librosa.load(audio_path, sr=sampling_rate)
	sig, sr = load_audio(audio_path, sr=sampling_rate)

Conversation

knitcapcat-amd commented Apr 20, 2026

Conflicts resolved

Verification

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants