`max_tokens`'s default in docstring of `InferenceClient::chat_completion`'s is 20 but that of TGI is 100 #2652

sadra-barikbin · 2024-11-03T14:48:51Z

Hi there!🤗

InferenceClient::chat_completion's docstring states that max_tokens's default value is 20 but it seems that of the TGI is 100.

huggingface_hub/src/huggingface_hub/inference/_client.py

Lines 586 to 587 in 8a99deb

    
                       max_tokens (`int`, *optional*): 
        
                           Maximum number of tokens allowed in the response. Defaults to 20.

https://github.com/huggingface/text-generation-inference/blob/6e3220529df5906ae586031873b7865e9923040b/router/src/lib.rs#L939

The text was updated successfully, but these errors were encountered:

hanouticelina · 2024-11-04T09:04:05Z

Hello @sadra-barikbin , thanks a lot for reporting this! 🤗 I just opened a PR #2653 to fix the docstrings.

sadra-barikbin changed the title ~~max_new_tokens's default in docstring of InferenceClient::chat_completion's is 20 but that of TGI is 100~~ max_tokens's default in docstring of InferenceClient::chat_completion's is 20 but that of TGI is 100 Nov 3, 2024

sadra-barikbin changed the title ~~max_tokens's default in docstring of InferenceClient::chat_completion's is 20 but that of TGI is 100~~ max_tokens's default in docstring of InferenceClient::chat_completion's is 20 but that of TGI is 100 Nov 3, 2024

hanouticelina mentioned this issue Nov 4, 2024

[InferenceClient] Update max_tokens and max_new_tokens default value in docstring #2653

Merged

hanouticelina closed this as completed in #2653 Nov 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`max_tokens`'s default in docstring of `InferenceClient::chat_completion`'s is 20 but that of TGI is 100 #2652

`max_tokens`'s default in docstring of `InferenceClient::chat_completion`'s is 20 but that of TGI is 100 #2652

sadra-barikbin commented Nov 3, 2024

hanouticelina commented Nov 4, 2024

max_tokens's default in docstring of InferenceClient::chat_completion's is 20 but that of TGI is 100 #2652

max_tokens's default in docstring of InferenceClient::chat_completion's is 20 but that of TGI is 100 #2652

Comments

sadra-barikbin commented Nov 3, 2024

hanouticelina commented Nov 4, 2024

`max_tokens`'s default in docstring of `InferenceClient::chat_completion`'s is 20 but that of TGI is 100 #2652

`max_tokens`'s default in docstring of `InferenceClient::chat_completion`'s is 20 but that of TGI is 100 #2652