Skip to content

[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement#38139

Merged
noooop merged 5 commits intomainfrom
wentao-remove-redundant-prompt-copy
Mar 29, 2026
Merged

[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement#38139
noooop merged 5 commits intomainfrom
wentao-remove-redundant-prompt-copy

Commits

Commits on Mar 25, 2026

Commits on Mar 26, 2026

Commits on Mar 27, 2026

Commits on Mar 28, 2026

Commits on Mar 29, 2026