Support min_tokens, min_p parameters for api_server #2681

AllentDan · 2024-10-29T10:25:56Z

No description provided.

AllentDan · 2024-10-29T11:07:35Z

The weird thing is, min_new_tokens did not work. @irexyc

irexyc · 2024-10-29T11:32:51Z

from lmdeploy import pipeline
pipe = pipeline('/nvme/shared/vicuna-7b-v1.5/')
pipe('hello', gen_config=GenerationConfig())
# Response(text='ಠ\\_ಠ', generate_token_len=7, input_token_len=41, session_id=8, finish_reason='stop', token_ids=[227, 181, 163, 20122, 227, 181, 163], logprobs=None, index=0)

pipe('hello', gen_config=GenerationConfig(min_new_tokens=8))
# Response(text='ಠ\\_ಠ How can I assist you today?', generate_token_len=14, input_token_len=41, session_id=7, finish_reason='stop', token_ids=[227, 181, 163, 20122, 227, 181, 163, 1128, 508, 306, 6985, 366, 9826, 29973], logprobs=None, index=0)

AllentDan · 2024-10-30T02:21:31Z

But if I set min_new_tokens=100 for internlm2-chat-1_8b, the result is less than 100 tokens.

irexyc · 2024-10-30T03:03:34Z

It only ignore eos token when length of the generated token is less than min_new_tokens. Its behavior should be consistent with transformers, except that transformers support multiple eos tokens

But If encountered stop words token, it will still stop.

lvhan028 · 2024-11-01T03:10:52Z

Should we update completions_v1 as well?

AllentDan · 2024-11-01T11:36:29Z

It is a lagecy api from OpenAI. I tend to not update it.

* Support min_tokens for api_server * fix * use min_new_tokens * add min_p

Support min_tokens for api_server

0d846ec

AllentDan mentioned this pull request Oct 29, 2024

[Bug] request parameter min_new_tokens is not used #2678

Closed

3 tasks

AllentDan added 2 commits October 29, 2024 18:45

fix

e8fe88a

use min_new_tokens

f61a0f0

add min_p

41dcf93

AllentDan mentioned this pull request Oct 30, 2024

[Bug] min_p from request is not used #2682

Closed

3 tasks

lvhan028 changed the title ~~Support min_tokens for api_server~~ Support min_tokens, min_p parameters for api_server Oct 30, 2024

lvhan028 requested review from irexyc and lvhan028 October 31, 2024 06:37

lvhan028 added the Bug:P1 label Oct 31, 2024

irexyc approved these changes Oct 31, 2024

View reviewed changes

lvhan028 approved these changes Nov 1, 2024

View reviewed changes

lvhan028 merged commit 654c457 into InternLM:main Nov 1, 2024
5 checks passed

lvhan028 pushed a commit that referenced this pull request Nov 5, 2024

Support min_tokens, min_p parameters for api_server (#2681)

1d2b9c6

* Support min_tokens for api_server * fix * use min_new_tokens * add min_p

AllentDan added a commit to AllentDan/lmdeploy that referenced this pull request Nov 13, 2024

Support min_tokens, min_p parameters for api_server (InternLM#2681)

03b9802

* Support min_tokens for api_server * fix * use min_new_tokens * add min_p

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support min_tokens, min_p parameters for api_server #2681

Support min_tokens, min_p parameters for api_server #2681

AllentDan commented Oct 29, 2024

AllentDan commented Oct 29, 2024

irexyc commented Oct 29, 2024

AllentDan commented Oct 30, 2024

irexyc commented Oct 30, 2024

lvhan028 commented Nov 1, 2024

AllentDan commented Nov 1, 2024

Support min_tokens, min_p parameters for api_server #2681

Support min_tokens, min_p parameters for api_server #2681

Conversation

AllentDan commented Oct 29, 2024

AllentDan commented Oct 29, 2024

irexyc commented Oct 29, 2024

AllentDan commented Oct 30, 2024

irexyc commented Oct 30, 2024

lvhan028 commented Nov 1, 2024

AllentDan commented Nov 1, 2024