position ids for kv-cache #71

kashif · 2025-05-22T11:00:34Z

No description provided.

lusxvr · 2025-05-22T15:36:58Z

How does this differ from @ariG23498 's PR (#69)? Did you test it with the benchmark scripts if there is a noticeable speedup?

kashif · 2025-05-22T16:07:01Z

@lusxvr this is a fix to PR #69

lusxvr · 2025-05-22T16:08:43Z

Shouldn't we put it in the same PR then? Or is is also a general fix? I am a bit confused what exactly it does

kashif · 2025-05-22T17:06:37Z

I didn't have permissions, so I made a PR on the PR... sorry about the confusion. It adds the position_id so that the rope position embeddings are added correctly, and then we sample a new token sequentially using the kv-cache up till the max_new_token lengths in the generate

kashif · 2025-05-22T17:07:14Z

@ariG23498 will test it out in the morning I believe

lusxvr · 2025-05-22T17:13:37Z

Ah okay, I see. I added you to the repo though, you should have permission!

ariG23498

Loved the implementation.

* position ids for rope * cleanup * no need for mask * no mask * more cleanup * add back filtering * more cleanup * revert the signature of llm's generate and forward * use self.decoder.lm_use_tokens * use torch inference_mode * add back comment * fix bug * add back comments * add back comments

position ids for rope

bf0bfb1

kashif marked this pull request as draft May 22, 2025 11:03

kashif added 8 commits May 22, 2025 11:06

cleanup

79ba3ad

no need for mask

e5d4897

no mask

afb27c2

more cleanup

1f1ea38

add back filtering

7814c94

more cleanup

164ac2c

revert the signature of llm's generate and forward

90a2871

use self.decoder.lm_use_tokens

661288a

kashif marked this pull request as ready for review May 22, 2025 13:39

kashif requested review from ariG23498 and lusxvr May 22, 2025 13:39

kashif added 2 commits May 22, 2025 13:45

use torch inference_mode

1a372e4

add back comment

7130f86

kashif changed the title ~~position ids kv-cache~~ position ids for kv-cache May 22, 2025

kashif added 2 commits May 22, 2025 14:24

fix bug

fe7a959

add back comments

09652dd

add back comments

f735b53

ariG23498 approved these changes May 23, 2025

View reviewed changes

ariG23498 merged commit 08e512a into huggingface:kv-cache May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

position ids for kv-cache #71

position ids for kv-cache #71

Uh oh!

kashif commented May 22, 2025

Uh oh!

lusxvr commented May 22, 2025

Uh oh!

kashif commented May 22, 2025

Uh oh!

lusxvr commented May 22, 2025

Uh oh!

kashif commented May 22, 2025

Uh oh!

kashif commented May 22, 2025

Uh oh!

lusxvr commented May 22, 2025

Uh oh!

ariG23498 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

position ids for kv-cache #71

position ids for kv-cache #71

Uh oh!

Conversation

kashif commented May 22, 2025

Uh oh!

lusxvr commented May 22, 2025

Uh oh!

kashif commented May 22, 2025

Uh oh!

lusxvr commented May 22, 2025

Uh oh!

kashif commented May 22, 2025

Uh oh!

kashif commented May 22, 2025

Uh oh!

lusxvr commented May 22, 2025

Uh oh!

ariG23498 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants