-
Notifications
You must be signed in to change notification settings - Fork 255
Set add_special_tokens to false by default in Encode
#1442
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Ok, meeting conclusion is that we don't need this API currently as our internal default values will do what users want. This way we avoid exposing an option that nobody knows what value to set to. |
add_special_tokens to false by default in Encode
Co-authored-by: Ryan Hill <[email protected]>
RyanUnderhill
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure if you want to say in the PR comments why we default to false, just to have some history for it if we forget why we did this in the future.
|
Sets `add_special_tokens` from `OrtxTokenizeWithOptions` added in microsoft/onnxruntime-extensions#940 to false to solve chat template issue in GenAI with extra BOS tokens. See huggingface/transformers#37686 for more context. --------- Co-authored-by: Sayan Shaw <[email protected]> Co-authored-by: Ryan Hill <[email protected]>
Update version to 0.8.0-rc2 and cherry pick these 3 changes: #1435 update ESRP settings #1434 make WebGPU name consistent #1432 Missed an all lowercase "webgpu" string #1440 Apply provider name backwards compatibility at runtime #1452 Update Extensions Commit to Support Chat Template Override for Unsupported Models #1439 Sign macos binaries #1442 Set `add_special_tokens` --------- Co-authored-by: Guenther Schmuelling <[email protected]> Co-authored-by: Sayan Shaw <[email protected]> Co-authored-by: Baiju Meswani <[email protected]> Co-authored-by: Sayan Shaw <[email protected]> Co-authored-by: kunal-vaishnavi <[email protected]>
Sets
add_special_tokensfromOrtxTokenizeWithOptionsadded in microsoft/onnxruntime-extensions#940 to false to solve chat template issue in GenAI with extra BOS tokens.See huggingface/transformers#37686 for more context.