Replies: 1 comment 1 reply
-
it seems it run on every process (gpu). By the way, DeepSpeed inherently needs to sync across GPUs, if the inputs are different across GPU, it seems hang forever with 100% GPU utilization (which is kind of confusing). |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In the inference tutorial: https://www.deepspeed.ai/tutorials/inference-tutorial/ , for this example:
I want to know:
Beta Was this translation helpful? Give feedback.
All reactions