Skip to content

[WIP] Support for cached multi-query attention towards speculative decoding #1679

Closed
skrider wants to merge 14 commits intovllm-project:mainfrom
skrider:cached-mqa
Closed

[WIP] Support for cached multi-query attention towards speculative decoding #1679
skrider wants to merge 14 commits intovllm-project:mainfrom
skrider:cached-mqa

Commits

Commits on Nov 30, 2023

Commits on Dec 1, 2023

Commits on Dec 2, 2023

Commits on Dec 23, 2023

Commits on Dec 24, 2023

Commits on Jan 9, 2024