Skip to content

[Core][CUDA Graph] add output buffer for cudagraph to reduce memory footprint#5074

Merged
youkaichao merged 10 commits into
vllm-project:mainfrom
youkaichao:cudagraph_save_memory
Jun 9, 2024
Merged

[Core][CUDA Graph] add output buffer for cudagraph to reduce memory footprint#5074
youkaichao merged 10 commits into
vllm-project:mainfrom
youkaichao:cudagraph_save_memory

Commits

Commits on May 27, 2024

Commits on May 30, 2024

Commits on Jun 7, 2024