Skip to content

[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model#17326

Merged
WoosukKwon merged 12 commits intovllm-project:mainfrom
ekagra-ranjan:er-eagle-reuse-embed
May 14, 2025
Merged

[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model#17326
WoosukKwon merged 12 commits intovllm-project:mainfrom
ekagra-ranjan:er-eagle-reuse-embed

Commits

Commits on Apr 28, 2025

Commits on May 10, 2025

Commits on May 14, 2025