Skip to content

Use pin_memory in forward_batch.init_new to reduce decoding latency#21360

Open
litmei wants to merge 13 commits intosgl-project:mainfrom
litmei:decode_low_latency
Open

Use pin_memory in forward_batch.init_new to reduce decoding latency#21360
litmei wants to merge 13 commits intosgl-project:mainfrom
litmei:decode_low_latency

Commits

Commits on Mar 25, 2026

Commits on Mar 27, 2026

Commits on Mar 30, 2026

Commits on Apr 1, 2026

Commits on Apr 6, 2026

Commits on Apr 7, 2026

Commits on Apr 8, 2026

Commits on Apr 9, 2026

Commits on Apr 10, 2026