[Frontend] OpenAI API server: Add `add_special_tokens` to ChatCompletionRequest (default False) by tomeras91 · Pull Request #5278 · vllm-project/vllm

tomeras91 · 2024-06-05T12:09:42Z

#4688 introduced a change to how messages are formatted into a prompt for the chat endpoint - the prompt is tokenized with add_special_tokens=False so a BOS token is not added. It is assumed that the chat template takes care of adding all needed special tokens.

This PR aims to make this behavior configurable instead of hardcoded. By adding add_special_tokens as a field to ChatCompletionRequest, the user can control whether a BOS token should be added or not. This is useful because not all chat templates add the BOS token.

…se it when tokenizing prompt

DarkLight1337 · 2024-06-05T14:20:29Z

LGTM, thanks for making it configurable!

…ionRequest (default False) (vllm-project#5278)

tomeras91 added 3 commits June 5, 2024 14:54

Add add_special_tokens to ChatCompletionRequest (default False) and u…

a880a45

…se it when tokenizing prompt

format

074261c

format

6b5611f

DarkLight1337 approved these changes Jun 5, 2024

View reviewed changes

DarkLight1337 mentioned this pull request Jun 5, 2024

[Bug]: Regression in predictions in v0.4.3 #5280

Closed

simon-mo merged commit f0a5005 into vllm-project:main Jun 5, 2024

chengzhi-lu pushed a commit to chengzhi-lu/vllm that referenced this pull request Jun 6, 2024

[Frontend] OpenAI API server: Add add_special_tokens to ChatComplet…

4848175

…ionRequest (default False) (vllm-project#5278)

robertgshaw2-redhat pushed a commit to neuralmagic/nm-vllm that referenced this pull request Jun 11, 2024

[Frontend] OpenAI API server: Add add_special_tokens to ChatComplet…

47c1256

…ionRequest (default False) (vllm-project#5278)

joerunde pushed a commit to joerunde/vllm that referenced this pull request Jun 17, 2024

[Frontend] OpenAI API server: Add add_special_tokens to ChatComplet…

66ace0c

…ionRequest (default False) (vllm-project#5278)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jun 27, 2024

[Frontend] OpenAI API server: Add add_special_tokens to ChatComplet…

b8fd7c5

…ionRequest (default False) (vllm-project#5278)

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 8, 2024

[Frontend] OpenAI API server: Add add_special_tokens to ChatComplet…

f466b8e

…ionRequest (default False) (vllm-project#5278)

DarkLight1337 mentioned this pull request Jul 19, 2024

[Frontend] Refactor prompt processing #4028

Merged

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Frontend] OpenAI API server: Add add_special_tokens to ChatComplet…

76b50b7

…ionRequest (default False) (vllm-project#5278)

tomeras91 deleted the configurable-bos branch August 12, 2024 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

[Frontend] OpenAI API server: Add `add_special_tokens` to ChatCompletionRequest (default False)#5278

[Frontend] OpenAI API server: Add `add_special_tokens` to ChatCompletionRequest (default False)#5278
simon-mo merged 3 commits intovllm-project:mainfrom
tomeras91:configurable-bos

tomeras91 commented Jun 5, 2024

Uh oh!

DarkLight1337 commented Jun 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Comments

Conversation

tomeras91 commented Jun 5, 2024

Uh oh!

DarkLight1337 commented Jun 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants