[Feat] Support layerwise CPU offloading for more videogen models#2018
Conversation
86acc80 to
53745f6
Compare
53745f6 to
14cd581
Compare
ad5e0ad to
02cd7b6
Compare
|
Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits. |
|
Wait for #2809 to be merged first, and then rebase |
Signed-off-by: Yuanheng Zhao <jonathan.zhaoyh@gmail.com>
Signed-off-by: Yuanheng Zhao <jonathan.zhaoyh@gmail.com>
Signed-off-by: yuanheng <jonathan.zhaoyh@gmail.com>
Signed-off-by: yuanheng <jonathan.zhaoyh@gmail.com>
Signed-off-by: yuanheng <jonathan.zhaoyh@gmail.com>
Signed-off-by: yuanheng <jonathan.zhaoyh@gmail.com>
35dee95 to
78f0b6e
Compare
Done |
|
Layerwise offloading now supported on LTX-2, DreamID-Omni. cc @wtomin , @gcanlin , @Bounty-hunter |
|
According to #1832, DreamID-Omni has no L4 e2e test yet. Could you create As for LTX-2, there is an existing PR #2815. I will remind @Songrui625 to cover the layerwise cpu offloading feature when this PR is merged. |
For now it's still not easy to run I'd like to raise another PR to fix both modeling and dependency for |
I am totally fine with it. |
Signed-off-by: Yuanheng Zhao <jonathan.zhaoyh@gmail.com>
|
For updated commit, layerwise offloading enabled out_dreamid_omni_oneip-3.mp4 |
|
cc @wtomin , @hsliuustc0106 |
|
LGTM |
…m-project#2018) Signed-off-by: Yuanheng Zhao <jonathan.zhaoyh@gmail.com> Signed-off-by: yuanheng <jonathan.zhaoyh@gmail.com> Co-authored-by: Didan Deng <33117903+wtomin@users.noreply.github.com>
…m-project#2018) Signed-off-by: Yuanheng Zhao <jonathan.zhaoyh@gmail.com> Signed-off-by: yuanheng <jonathan.zhaoyh@gmail.com> Co-authored-by: Didan Deng <33117903+wtomin@users.noreply.github.com>
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
Purpose
Part of #1217
Support and test layerwise CPU offloading on more models
Test Plan
e2e generation and output quality comparison
Lightricks/LTX-2
XuGuo699/DreamID-Omni
Test Result
Stats:
*Collected by
DiffusionModelRunner._record_peak_memoryGenerated Videos:
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model. Please runmkdocs serveto sync the documentation editions to./docs.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)