[WIP] Support for cached multi-query attention towards speculative decoding #1679
Closed
skrider wants to merge 14 commits intovllm-project:mainfrom
Closed
[WIP] Support for cached multi-query attention towards speculative decoding #1679skrider wants to merge 14 commits intovllm-project:mainfrom
skrider wants to merge 14 commits intovllm-project:mainfrom
Commits
Commits on Nov 30, 2023
- authored andcommitted

- authored andcommitted

- authored andcommitted

Commits on Dec 1, 2023
- authored andcommitted

- authored andcommitted

- authored andcommitted

- committed
- committed
- committed
Commits on Dec 2, 2023
Commits on Dec 23, 2023
- committed
- committed