extend bucket_internal to SAMPLE generation mode by xt574chen · Pull Request #819 · huggingface/optimum-habana

xt574chen · 2024-03-19T13:22:44Z

What does this PR do?

Extend function #24 to sample mode.

The command to reproduce performance is as follows:
python ../gaudi_spawn.py --use_deepspeed --world_size 4 run_generation.py --model_name_or_path meta-llama/Llama-2-70b-hf --use_hpu_graphs --use_kv_cache --max_input_tokens 128 --max_new_tokens 2048 --batch_size 240 --attn_softmax_bf16 --trim_logits --bf16 --reuse_cache --warmup 1 --n_iterations 1 --limit_hpu_graphs --do_sample --bucket_size 256 --bucket_internal

puneeshkhanna · 2024-03-22T13:10:33Z

Changes look good to me. Same as we have in greedy search and enables bucketing in sampling mode too.

xt574chen · 2024-03-25T02:13:44Z

@regisss Could u help review and merge it?

xt574chen requested review from bhargaveede, ssarkar2 and vivekgoe as code owners March 19, 2024 13:22

[feat] extend bucket_internal to SAMPLE generation mode

5254317

ssarkar2 approved these changes Apr 22, 2024

View reviewed changes

ssarkar2 added the run-test Run CI for PRs from external contributors label Apr 22, 2024

regisss approved these changes Apr 26, 2024

View reviewed changes

regisss merged commit 155fe07 into huggingface:main Apr 26, 2024

ccrhx4 pushed a commit to ccrhx4/ccrhx4.optimum-habana that referenced this pull request May 11, 2024

Extend bucket_internal to SAMPLE generation mode (huggingface#819)

a3b0ea7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extend bucket_internal to SAMPLE generation mode#819

extend bucket_internal to SAMPLE generation mode#819
regisss merged 1 commit into
huggingface:mainfrom
xt574chen:feat_extend_bucket_internal

xt574chen commented Mar 19, 2024

Uh oh!

puneeshkhanna commented Mar 22, 2024 •

edited

Loading

Uh oh!

xt574chen commented Mar 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

xt574chen commented Mar 19, 2024

What does this PR do?

Uh oh!

puneeshkhanna commented Mar 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xt574chen commented Mar 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

puneeshkhanna commented Mar 22, 2024 •

edited

Loading