📚 The doc issue
LLM.wait_for_completion() documents output_type as defaulting to RequestOutput, but the implementation defaults to accepting both RequestOutput and PoolingRequestOutput when output_type is not provided.
This can mislead readers into thinking the method only expects generation outputs by default, while the actual behavior is broader.
Suggest a potential alternative/fix
Update the output_type argument description to state that, when omitted, it accepts both RequestOutput and PoolingRequestOutput.
Before submitting a new issue...
📚 The doc issue
LLM.wait_for_completion()documentsoutput_typeas defaulting toRequestOutput, but the implementation defaults to accepting bothRequestOutputandPoolingRequestOutputwhenoutput_typeis not provided.This can mislead readers into thinking the method only expects generation outputs by default, while the actual behavior is broader.
Suggest a potential alternative/fix
Update the
output_typeargument description to state that, when omitted, it accepts bothRequestOutputandPoolingRequestOutput.Before submitting a new issue...