Skip to content

[Speculative Decoding] Support draft model on different tensor-parallel size than target model#5414

Merged
comaniac merged 131 commits intovllm-project:mainfrom wooyeonlee0:spec-tp1-draftJun 25, 2024

Commits

Commits on Jun 10, 2024

Commits on Jun 12, 2024

Commits on Jun 13, 2024

Commits on Jun 14, 2024

Commits on Jun 17, 2024

Commits on Jun 18, 2024

Commits on Jun 19, 2024

Commits on Jun 20, 2024

Commits on Jun 21, 2024

Commits on Jun 24, 2024

Commits on Jun 25, 2024