[Doc]: LLM.wait_for_completion output_type default is inaccurate

### 📚 The doc issue

`LLM.wait_for_completion()` documents `output_type` as defaulting to `RequestOutput`, but the implementation defaults to accepting both `RequestOutput` and `PoolingRequestOutput` when `output_type` is not provided.

This can mislead readers into thinking the method only expects generation outputs by default, while the actual behavior is broader.

### Suggest a potential alternative/fix

Update the `output_type` argument description to state that, when omitted, it accepts both `RequestOutput` and `PoolingRequestOutput`.

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Doc]: LLM.wait_for_completion output_type default is inaccurate #44616

📚 The doc issue

Suggest a potential alternative/fix

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

[Doc]: LLM.wait_for_completion output_type default is inaccurate #44616

Description

📚 The doc issue

Suggest a potential alternative/fix

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions