Skip to content

[Profiler] Add Nsight Systems support for diffusion workers#2382

Closed
lishunyang12 wants to merge 1 commit into
vllm-project:mainfrom
lishunyang12:nsight-systems-profiling
Closed

[Profiler] Add Nsight Systems support for diffusion workers#2382
lishunyang12 wants to merge 1 commit into
vllm-project:mainfrom
lishunyang12:nsight-systems-profiling

Conversation

@lishunyang12
Copy link
Copy Markdown
Collaborator

@lishunyang12 lishunyang12 commented Mar 31, 2026

Supersedes #1098 which went stale due to extensive rebase requirements.

@lishunyang12 lishunyang12 force-pushed the nsight-systems-profiling branch from a6c17e5 to 0ebe551 Compare March 31, 2026 14:29
Add CudaProfilerWrapper support to DiffusionWorker so that diffusion
stages can be profiled with NVIDIA Nsight Systems (nsys). LLM stages
already inherit this from vLLM's GPUWorker, but diffusion workers only
handled profiler: "torch" until now.

Signed-off-by: lishunyang <lishunyang12@163.com>
@lishunyang12 lishunyang12 force-pushed the nsight-systems-profiling branch from 0ebe551 to eaa103c Compare March 31, 2026 14:34
@lishunyang12 lishunyang12 marked this pull request as draft March 31, 2026 14:41
@lishunyang12
Copy link
Copy Markdown
Collaborator Author

Closed for now. Will resume in the near future.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant