New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Drop RoPE when filling KV cache #3346

Closed

GD06 wants to merge 1 commit into main from genai_nope_qkv

+264 −0

Contributor

GD06 commented Nov 9, 2024 •

edited

Loading

This PR provides CUDA kernels to fill KV cache without applying ROPE.

facebook-github-bot added the cla signed label

netlify bot commented Nov 9, 2024 •

edited

Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`b5f25d5`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/673f8241933bb60008443b5f
😎 Deploy Preview	https://deploy-preview-3346--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

GD06 force-pushed the genai_nope_qkv branch from f8c8937 to 77e3b5c Compare

November 21, 2024 07:32

GD06 requested a review from jianyuh

November 21, 2024 07:32

jianyuh approved these changes

View reviewed changes

Contributor

facebook-github-bot commented Nov 21, 2024

@GD06 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot pushed a commit that referenced this pull request


          Drop RoPE when filling KV cache (#3346)

3aab114

Summary:
This PR provides CUDA kernels to fill KV cache without applying ROPE.


Reviewed By: jianyuh

Differential Revision: D66307820

Pulled By: GD06

facebook-github-bot force-pushed the genai_nope_qkv branch from 77e3b5c to 3aab114 Compare

November 21, 2024 18:18

Contributor

facebook-github-bot commented Nov 21, 2024

This pull request was exported from Phabricator. Differential Revision: D66307820

facebook-github-bot added the fb-exported label

facebook-github-bot pushed a commit that referenced this pull request


          Drop RoPE when filling KV cache (#3346)

ae41dfa

Summary:
This PR provides CUDA kernels to fill KV cache without applying ROPE.


Reviewed By: jianyuh

Differential Revision: D66307820

Pulled By: GD06

facebook-github-bot force-pushed the genai_nope_qkv branch from 3aab114 to ae41dfa Compare

November 21, 2024 18:31

Contributor

facebook-github-bot commented Nov 21, 2024

This pull request was exported from Phabricator. Differential Revision: D66307820

facebook-github-bot pushed a commit that referenced this pull request


          Drop RoPE when filling KV cache (#3346)

25e2d83

Summary:
This PR provides CUDA kernels to fill KV cache without applying ROPE.


Reviewed By: jianyuh

Differential Revision: D66307820

Pulled By: GD06

facebook-github-bot force-pushed the genai_nope_qkv branch from ae41dfa to 25e2d83 Compare

November 21, 2024 18:44

Contributor

facebook-github-bot commented Nov 21, 2024

This pull request was exported from Phabricator. Differential Revision: D66307820

facebook-github-bot pushed a commit that referenced this pull request


          Drop RoPE when filling KV cache (#3346)

5cbab5a

Summary:
X-link: facebookresearch/FBGEMM#488

This PR provides CUDA kernels to fill KV cache without applying ROPE.


Reviewed By: jianyuh

Differential Revision: D66307820

Pulled By: GD06

facebook-github-bot force-pushed the genai_nope_qkv branch from 25e2d83 to 5cbab5a Compare

November 21, 2024 18:53

Contributor

facebook-github-bot commented Nov 21, 2024

This pull request was exported from Phabricator. Differential Revision: D66307820


          Drop RoPE when filling KV cache (#3346)

b5f25d5

Summary:
X-link: facebookresearch/FBGEMM#488

This PR provides CUDA kernels to fill KV cache without applying ROPE.


Reviewed By: jianyuh

Differential Revision: D66307820

Pulled By: GD06

facebook-github-bot force-pushed the genai_nope_qkv branch from 5cbab5a to b5f25d5 Compare

November 21, 2024 18:55

Contributor

facebook-github-bot commented Nov 21, 2024

This pull request was exported from Phabricator. Differential Revision: D66307820

1 similar comment

Contributor

facebook-github-bot commented Nov 21, 2024

This pull request was exported from Phabricator. Differential Revision: D66307820

facebook-github-bot closed this in

c7a84ca

Contributor

facebook-github-bot commented Nov 21, 2024

@GD06 merged this pull request in c7a84ca.

facebook-github-bot added the Merged label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed fb-exported Merged