-
-
Notifications
You must be signed in to change notification settings - Fork 11.5k
Add full serve CLI reference back to docs #20978
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -17,6 +17,7 @@ cloudpickle | |
| fastapi | ||
| msgspec | ||
| openai | ||
| partial-json-parser | ||
| pillow | ||
| psutil | ||
| pybase64 | ||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -248,6 +248,34 @@ def make_arg_parser(parser: FlexibleArgumentParser) -> FlexibleArgumentParser: | |
| register all arguments instead of manually enumerating them here. This | ||
| avoids code duplication and keeps the argument definitions in one place. | ||
| """ | ||
| parser.add_argument("model_tag", | ||
| type=str, | ||
| nargs="?", | ||
| help="The model tag to serve " | ||
| "(optional if specified in config)") | ||
| parser.add_argument( | ||
| "--headless", | ||
| action="store_true", | ||
| default=False, | ||
| help="Run in headless mode. See multi-node data parallel " | ||
| "documentation for more details.") | ||
| parser.add_argument( | ||
| "--data-parallel-start-rank", | ||
| "-dpr", | ||
| type=int, | ||
| default=0, | ||
| help="Starting data parallel rank for secondary nodes. " | ||
| "Requires --headless.") | ||
| parser.add_argument("--api-server-count", | ||
| "-asc", | ||
| type=int, | ||
| default=1, | ||
| help="How many API server processes to run.") | ||
|
Comment on lines
+256
to
+273
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. At high level, these belong to FrontEndArgs?
Member
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I thought about adding them, but I wasn't sure about the data parallel ones though, because they are sort of "pre-frontend" |
||
| parser.add_argument( | ||
| "--config", | ||
| help="Read CLI options from a config file. " | ||
| "Must be a YAML with the following options: " | ||
| "https://docs.vllm.ai/en/latest/configuration/serve_args.html") | ||
| parser = FrontendArgs.add_cli_args(parser) | ||
| parser = AsyncEngineArgs.add_cli_args(parser) | ||
|
|
||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The heading
### helpis a bit generic and could be more descriptive for users navigating the documentation. A more specific heading would improve clarity.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would call this section Options