Skip to content

Commit 03629e1

Browse files
LucasWilkinsonchoprahetarth
authored andcommitted
[BugFix] Fix DBO hang (vllm-project#25625)
Signed-off-by: Lucas Wilkinson <[email protected]>
1 parent 2af1417 commit 03629e1

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

vllm/v1/worker/gpu_ubatch_wrapper.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -366,7 +366,8 @@ def __call__(self, *args, **kwargs):
366366
cudagraph_runtime_mode=CUDAGraphMode.NONE)
367367
with self.sm_control:
368368
return self._capture_ubatches(ubatch_metadata, self.model)
369-
elif num_tokens in self.cudagraphs:
369+
elif num_tokens in self.cudagraphs \
370+
and cudagraph_runtime_mode is CUDAGraphMode.FULL:
370371
cudagraph_metadata = self.cudagraphs[num_tokens]
371372
cudagraph_metadata.cudagraph.replay()
372373
return cudagraph_metadata.outputs

0 commit comments

Comments
 (0)