Skip to content

[Perf] Batch Weight Prefetching via cuMemcpyBatchAsync to Reduce Latency#41474

Open
xiaobao520123 wants to merge 1 commit into
vllm-project:mainfrom
xiaobao520123:feature/batch_memcpy_prefetch
Open

[Perf] Batch Weight Prefetching via cuMemcpyBatchAsync to Reduce Latency#41474
xiaobao520123 wants to merge 1 commit into
vllm-project:mainfrom
xiaobao520123:feature/batch_memcpy_prefetch

Commits

Commits on May 11, 2026