Skip to content

[Docs] Guide for using sleep mode and enable sleep mode#660

Merged
hsliuustc0106 merged 12 commits into
vllm-project:mainfrom
knlnguyen1802:sleep_wakeup_docs
Jan 6, 2026
Merged

[Docs] Guide for using sleep mode and enable sleep mode#660
hsliuustc0106 merged 12 commits into
vllm-project:mainfrom
knlnguyen1802:sleep_wakeup_docs

Conversation

@knlnguyen1802
Copy link
Copy Markdown
Contributor

@knlnguyen1802 knlnguyen1802 commented Jan 6, 2026

Resolve comment in #376 (comment)

Purpose

Test Plan

Test Result

cc: @ZJY0516


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Comment thread docs/features/sleep_mode.md
Comment thread docs/features/sleep_mode.md Outdated
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Comment thread docs/features/sleep_mode.md Outdated
Comment thread docs/features/sleep_mode.md Outdated
We added Sleep Mode support for **diffusion models**, which previously lacked this functionality.
In diffusion pipelines, this currently only offloads **model weight memory**, as these models typically do not use KV caches.

This means:
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These are not rendered correctly
https://vllm--660.org.readthedocs.build/projects/vllm-omni/en/660/features/sleep_mode/

This means: - Diffusion models can now enter Level 1 sleep. - Pipeline states (e.g., noise schedulers, buffers) remain intact after waking. - Useful for releasing VRAM between image generation or training cycles.

Copy link
Copy Markdown
Contributor Author

@knlnguyen1802 knlnguyen1802 Jan 6, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment thread docs/features/sleep_mode.md Outdated
knlnguyen1802 and others added 5 commits January 6, 2026 14:54
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Comment thread docs/features/sleep_mode.md
@ZJY0516 ZJY0516 requested a review from hsliuustc0106 January 6, 2026 07:15
@ZJY0516 ZJY0516 added the ready label to trigger buildkite CI label Jan 6, 2026
Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
@hsliuustc0106 hsliuustc0106 merged commit c32235c into vllm-project:main Jan 6, 2026
7 checks passed
Shirley125 pushed a commit to Shirley125/vllm-omni that referenced this pull request Jan 9, 2026
…#660)

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
princepride pushed a commit to princepride/vllm-omni that referenced this pull request Jan 10, 2026
…#660)

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
sniper35 pushed a commit to sniper35/vllm-omni that referenced this pull request Jan 10, 2026
…#660)

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
ZJY0516 added a commit to LawJarp-A/vllm-omni that referenced this pull request Jan 10, 2026
…#660)

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
@knlnguyen1802 knlnguyen1802 deleted the sleep_wakeup_docs branch May 6, 2026 07:38
daixinning pushed a commit to daixinning/vllm-omni that referenced this pull request May 28, 2026
…#660)

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Co-authored-by: Jiangyun Zhu <riverclouds.zhu@qq.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready label to trigger buildkite CI

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants