Fix for tokenizer.apply_chat_template with continue_final_message=True #34214

schoennenbeck · 2024-10-17T09:49:47Z

For a lot of tokenizers in Tokenizer.apply_chat_template with continue_final_message=True we get a "ValueError: substring not found" if the final message starts or ends in some whitespace. Here is some example code that exhibits the issue:

from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4")
messages=[
    {"role": "user", "content": "What is the capital of France?"},
    {"role": "assistant", "content": "Great question! The capital of France is "}
]
tokenizer.apply_chat_template(
    messages, add_generation_prompt=False, 
    continue_final_message=True
)

This is due to the fact that the apply_chat_template-method looks for the full final message in the rendered chat but many modern chat templates (in particular the Llama3.1-chat-template) actually trim messages before rendering.

This PR strips the final message before looking it up in the rendered string. This fixes the issue. However, this means that the continuation can now happen at a slightly different spot than the user intended. I believe this is the best way to address this issue but I would also be open to simply raise a more descriptive error in case this happens so the user can strip the last message themselves to handle this.

Either way I think the current failure mode is not ideal.

If required I could also add a test for this behaviour.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Potential reviewers (based on affected area)

tokenizers: @ArthurZucker
chat templates: @Rocketknight1

schoennenbeck · 2024-10-17T10:09:10Z

Alternative or possibly more robust way would be to only do the stripping if the full final message string cannot be found in the rendered chat as is.

Rocketknight1

@schoennenbeck this is a really good fix, thank you! I think we don't need the more robust solution - because of how whitespace is tokenized, I think it will be quite hard to end your message in multiple spaces and still get a good continuation.

(We're having some CI issues, but hopefully we'll resolve them later today and then I can merge this)

Rocketknight1 · 2024-10-17T14:45:21Z

@schoennenbeck merged! Thanks again for a clean and helpful PR!

huggingface#34214) * Strip final message * Do full strip instead of rstrip * Retrigger CI --------- Co-authored-by: Matt <[email protected]>

schoennenbeck added 2 commits October 17, 2024 11:32

Strip final message

70894cc

Do full strip instead of rstrip

487af4a

Rocketknight1 approved these changes Oct 17, 2024

View reviewed changes

Retrigger CI

1e99cbf

Rocketknight1 merged commit f2846ad into huggingface:main Oct 17, 2024

Rocketknight1 mentioned this pull request Oct 21, 2024

Retain newlines in chat template when continue_final_message=True #34253

Merged

5 tasks

socket-security bot mentioned this pull request Jul 1, 2025

Bump transformers from 4.52.4 to 4.53.0 alphasecio/prompt-guard#36

Closed

socket-security bot mentioned this pull request Jul 14, 2025

[Snyk] Fix for 4 vulnerabilities kingjay66/unilmf#256

Open

socket-security bot mentioned this pull request Aug 1, 2025

Bump transformers from 4.53.2 to 4.54.1 alphasecio/prompt-guard#39

Merged

socket-security bot mentioned this pull request Aug 12, 2025

[Snyk] Security upgrade transformers from 4.5.1 to 4.53.0 kingjay66/unilmf#271

Open

socket-security bot mentioned this pull request Sep 1, 2025

Bump transformers from 4.55.0 to 4.56.0 alphasecio/prompt-guard#43

Closed

This was referenced Sep 25, 2025

[Snyk] Security upgrade transformers from 4.30.2 to 4.53.0 kingjay66/unilmf#278

Open

[Snyk] Security upgrade transformers from 2.10.0 to 4.53.0 kingjay66/unilmf#279

Open

[Snyk] Security upgrade transformers from 4.5.1 to 4.53.0 kingjay66/unilmf#281

Open

socket-security bot mentioned this pull request Nov 1, 2025

Bump transformers from 4.56.2 to 4.57.1 alphasecio/prompt-guard#47

Closed

socket-security bot mentioned this pull request Dec 29, 2025

[Snyk] Security upgrade transformers from 4.23.1 to 5.0.0rc1 kingjay66/unilmf#307

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix for tokenizer.apply_chat_template with continue_final_message=True #34214

Fix for tokenizer.apply_chat_template with continue_final_message=True #34214

Uh oh!

schoennenbeck commented Oct 17, 2024

Uh oh!

schoennenbeck commented Oct 17, 2024

Uh oh!

Rocketknight1 left a comment •

edited

Loading

Uh oh!

Rocketknight1 commented Oct 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix for tokenizer.apply_chat_template with continue_final_message=True #34214

Fix for tokenizer.apply_chat_template with continue_final_message=True #34214

Uh oh!

Conversation

schoennenbeck commented Oct 17, 2024

Before submitting

Who can review?

Potential reviewers (based on affected area)

Uh oh!

schoennenbeck commented Oct 17, 2024

Uh oh!

Rocketknight1 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Oct 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Rocketknight1 left a comment •

edited

Loading