[Feature] Option to save model weights to CPU when memory saver mode is enabled #10873

mattnappo · 2025-09-24T16:00:21Z

Motivation

This PR addresses confusion in enable_memory_saver mode, which is raised in several issues:

Even though the initial use case for torch_memory_saver is for RL, offloading weights from GPU to CPU memory is useful for other use cases, such as memory snapshot/restore. This PR adds a flag to set enable_weights_cpu_backup to enable offloading model weights from GPU to CPU memory so that the weights can be restored.

Modifications

Add flag enable_weights_cpu_backup to ServerArgs to enable offloading model weights from GPU to CPU memory.
Update model runner to set enable_weights_cpu_backup during model loading.
Bump torch-memory-saver version from 0.0.8 to 0.0.9rc1
Other small unrelated formatting changes (such as removing unused imports)

Benchmark and Profiling

Verified accuracy when using enable_weights_cpu_backup in enable_memory_saver mode.
Verified memory usage when using enable_weights_cpu_backup in enable_memory_saver mode.
Added tests for enable_weights_cpu_backup in enable_memory_saver mode.

Bump torch_memory_saver version Only enable CPU backup for model weights Add flag Update test_release_memory_occupation.py

Fix lint

JustinTong0323 · 2025-09-30T22:52:47Z

~~please resolve conflicts~~ This PR is quite straight forward, we should check the compatibility with new version tms

ishandhanani · 2025-10-05T15:11:13Z

I think this might be causing some CI failures when trying to install for ARM.

https://github.com/sgl-project/sglang/actions/runs/18236550332/job/51931286314

and

https://github.com/sgl-project/sglang/actions/runs/18251221010/job/51965850200

@mattnappo @JustinTong0323

…is enabled (sgl-project#10873) Co-authored-by: molocule <[email protected]>

mattnappo force-pushed the main branch from e1bf7d6 to 26e9a4c Compare September 24, 2025 17:53

mattnappo marked this pull request as ready for review September 25, 2025 16:46

mattnappo requested review from Ying1123, hnyls2002, ispobock, merrymercy and zhyncs as code owners September 25, 2025 16:46

mattnappo force-pushed the main branch from f299706 to a2aedfa Compare September 25, 2025 16:49

mattnappo requested review from ByronHsu, CatherineSue, HaiShaw, slin1237 and xiezhq-hermann as code owners September 25, 2025 16:49

mattnappo and others added 2 commits September 25, 2025 16:49

Enable cpu backup

2aca390

Bump torch_memory_saver version Only enable CPU backup for model weights Add flag Update test_release_memory_occupation.py

Update test_release_memory_occupation.py

0490ae0

Fix lint

mattnappo force-pushed the main branch from a2aedfa to 0490ae0 Compare September 25, 2025 16:49

zhyncs assigned lifuhuang and JustinTong0323 Sep 28, 2025

zhyncs added the high priority label Sep 30, 2025

JustinTong0323 added the run-ci label Sep 30, 2025

Merge branch 'main' into main

39b5682

JustinTong0323 approved these changes Sep 30, 2025

View reviewed changes

Merge branch 'main' into main

a809524

hnyls2002 merged commit 8c57490 into sgl-project:main Oct 3, 2025
63 of 66 checks passed

0xtoward pushed a commit to 0xtoward/sglang that referenced this pull request Oct 5, 2025

[Feature] Option to save model weights to CPU when memory saver mode …

fb25a10

…is enabled (sgl-project#10873) Co-authored-by: molocule <[email protected]>

ch-tiger1 pushed a commit to ch-tiger1/sglang that referenced this pull request Oct 9, 2025

[Feature] Option to save model weights to CPU when memory saver mode …

f632a62

…is enabled (sgl-project#10873) Co-authored-by: molocule <[email protected]>

yyDing1 mentioned this pull request Oct 17, 2025

[trainer, worker] feat: more flexible and easy-to-use reward model volcengine/verl#3679

Merged

lpc0220 pushed a commit to lpc0220/sglang that referenced this pull request Oct 29, 2025

[Feature] Option to save model weights to CPU when memory saver mode …

0590e85

…is enabled (sgl-project#10873) Co-authored-by: molocule <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Option to save model weights to CPU when memory saver mode is enabled #10873

[Feature] Option to save model weights to CPU when memory saver mode is enabled #10873

Uh oh!

mattnappo commented Sep 24, 2025 •

edited

Loading

Uh oh!

JustinTong0323 commented Sep 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

ishandhanani commented Oct 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

[Feature] Option to save model weights to CPU when memory saver mode is enabled #10873

[Feature] Option to save model weights to CPU when memory saver mode is enabled #10873

Uh oh!

Conversation

mattnappo commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Benchmark and Profiling

Uh oh!

JustinTong0323 commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ishandhanani commented Oct 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

mattnappo commented Sep 24, 2025 •

edited

Loading

JustinTong0323 commented Sep 30, 2025 •

edited

Loading