Add served model alias support by krystophny · Pull Request #3 · computor-org/vllm-mlx

krystophny · 2026-03-24T07:55:04Z

Summary

add a dedicated --served-model-name server flag
expose a stable alias via the existing served_model_name load path
cover the new CLI option with parser tests

Why this is deployable on its own

it only changes how the server advertises and validates model names
it does not require the Responses branch to remain useful
it provides stable short aliases like qwen for local clients and benchmarks

Testing

PYTHONPATH=/Users/ert/code/vllm-mlx /Users/ert/code/.venv/bin/python -m pytest tests/test_server.py -q
python3 -m compileall vllm_mlx

krystophny · 2026-03-24T12:05:17Z

This PR is now superseded by upstream waybarrios/vllm-mlx#125, which merged --served-model-name support into main on March 21, 2026:
waybarrios#125

Our fork main is already synced to upstream main, so keeping this PR open no longer improves the stack. I am closing it to keep the fork PR set clean and non-duplicative.

feat: add served model alias flag

6167998

krystophny changed the title ~~Add served model alias flag~~ Add served model alias support Mar 24, 2026

krystophny closed this Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add served model alias support#3

Add served model alias support#3
krystophny wants to merge 1 commit intomainfrom
feature/served-model-name-alias

krystophny commented Mar 24, 2026 •

edited

Loading

Uh oh!

krystophny commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

krystophny commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why this is deployable on its own

Testing

Uh oh!

krystophny commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

krystophny commented Mar 24, 2026 •

edited

Loading