-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cudaErrorStreamCaptureUnsupported in cudagraph in multi-threads #2888
Comments
Could you try capturing using |
@nvpohanh yes I has set |
I see... then it seems that cudaDeviceSynchronize() will break cuda stream capture regardless of the modes. Is it possible to replace the cudaDeviceSynchronize() with something else? |
@nvpohanh Now I use |
@nvpohanh sometimes, error still exist, |
Fixed, closed. |
Description
When I use cudagraph in multi-threads(CPU threads), each thread has
preprocess
(default stream, withcudaDeviceSynchronize
), andinfer
(create stream, not default stream), cudagraph only captureenqueueV2
.Now it report error:
cuda error code=900 cudaErrorStreamCaptureUnsupported. cudaDeviceSynchronize()
Question
In the case of 1 GPU and multiple CPU threads, if I call cudaDeviceSynchronize() from one CPU thread, will it wait for other CPU threads to complete their work on the GPU device ?
And A-thread(CPU)
cudaDeviceSynchronize
will break the B-thread(CPU) graph, so it report error code=900 ?Thanks !
The text was updated successfully, but these errors were encountered: