Skip to content

Refactor STT runtime around model-owned adapters#173

Merged
ericcurtin merged 7 commits intovllm-project:mainfrom
LxYuan0420:refactor/stt-runtime-executor
Mar 19, 2026
Merged

Refactor STT runtime around model-owned adapters#173
ericcurtin merged 7 commits intovllm-project:mainfrom
LxYuan0420:refactor/stt-runtime-executor

Conversation

@LxYuan0420
Copy link
Copy Markdown
Collaborator

This PR is:

  • To extract STT execution from model_runner into a dedicated runtime module.
  • To move Whisper- and Qwen3-ASR-specific runtime behavior into model-owned adapters.
  • To remove the legacy top-level stt.transcribe module and keep the public STT API honest.

Next:

  • Quarantine hf_config and serve compatibility glue into a clearer compat-focused module.
  • Tighten the STT runtime adapter contract with stronger typing once the shape is stable.

@LxYuan0420 LxYuan0420 self-assigned this Mar 18, 2026
Move `STTExecutor` out of `vllm_metal/v1/model_runner.py` into
`vllm_metal/stt/runtime.py` so the runner can delegate STT-specific
feature extraction and decode behavior without growing more STT logic.

No behavior change intended.

Signed-off-by: Yuan Lik Xun <lxyuan0420@gmail.com>
Signed-off-by: Yuan Lik Xun <lxyuan0420@gmail.com>
Signed-off-by: Yuan Lik Xun <lxyuan0420@gmail.com>
Signed-off-by: Yuan Lik Xun <lxyuan0420@gmail.com>
Signed-off-by: Yuan Lik Xun <lxyuan0420@gmail.com>
@LxYuan0420 LxYuan0420 force-pushed the refactor/stt-runtime-executor branch from be33758 to b3f5843 Compare March 18, 2026 09:22
Signed-off-by: Yuan Lik Xun <lxyuan0420@gmail.com>
Signed-off-by: Yuan Lik Xun <lxyuan0420@gmail.com>
@ericcurtin ericcurtin merged commit aa60c55 into vllm-project:main Mar 19, 2026
5 checks passed
WindChimeRan added a commit to WindChimeRan/vllm-metal that referenced this pull request Mar 20, 2026
Resolve conflict in model_runner.py: keep STTExecutor class (from vllm-project#173),
drop stale MAX_PACKED_PREFILL_TOKENS constant (removed in this branch).

Signed-off-by: ran <hzz5361@psu.edu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants