Tokenizer config fix for dynamic mode (#144) by pramodkumar-habanalabs · Pull Request #1903 · huggingface/optimum-habana

pramodkumar-habanalabs · 2025-04-01T06:15:41Z

Issue:
For the Llama2 7B Dynamic mode, with padding="max_length", the tokenizer produces 'input_ids' without padding and having actual tokens only in the list. This is expected and works fine.

For the Llama3.1 8B Dynamic mode, with padding="max_length", the tokenizer produces 'input_ids' with padding and having 128K tokens (which is the max model input length. This is not expected.

Fix: The tokenizer configuration has been changed to address this issue.

regisss · 2025-04-16T18:21:10Z

@pramodkumar-habanalabs Can you share a code snippet that enables me to reproduce this issue with Llama 3.1 please?

libinta · 2025-04-23T16:14:36Z

@regisss You can try with below
PT_HPU_LAZY_MODE=0 python3 /root/repos/optimum-habana-fork/examples/text-generation/run_generation.py --model_name_or_path /mnt/weka/data/pytorch/llama3.1/Meta-Llama-3.1-8B-Instruct/ --attn_softmax_bf16 --use_kv_cache --max_new_tokens 128 --bf16 --batch_size 8 --trim_logits --max_input_tokens -1 --warmup 2 --torch_compile --dataset_name tatsu-lab/alpaca --run_partial_dataset --n_iterations 10

regisss

LGTM

[SW-218176]: Tokenizer config fix for dynamic mode (#144)

2f18ef7

pramodkumar-habanalabs requested a review from regisss as a code owner April 1, 2025 06:15

karol-brejna-i added the synapse 1.21 label Apr 7, 2025

libinta added the run-test Run CI for PRs from external contributors label Apr 9, 2025

libinta changed the title ~~[SW-218176]: Tokenizer config fix for dynamic mode (#144)~~ Tokenizer config fix for dynamic mode (#144) Apr 9, 2025

regisss approved these changes Apr 23, 2025

View reviewed changes

regisss merged commit b6c0691 into huggingface:main Apr 23, 2025
1 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tokenizer config fix for dynamic mode (#144)#1903

Tokenizer config fix for dynamic mode (#144)#1903
regisss merged 1 commit into
huggingface:mainfrom
HabanaAI:auto-pr-8ed3883

pramodkumar-habanalabs commented Apr 1, 2025

Uh oh!

regisss commented Apr 16, 2025

Uh oh!

libinta commented Apr 23, 2025

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

pramodkumar-habanalabs commented Apr 1, 2025

Uh oh!

regisss commented Apr 16, 2025

Uh oh!

libinta commented Apr 23, 2025

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants