-
Hello, I am trying to use DeepSpeed in the The trainer is:
and the corresponding import at the beginning of the file: Any other suggestion is more than welcome. Thanks, |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 13 replies
-
@nickKyr, is this a general problem with DeepSpeed+Lightning or specific to your scenario? |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
Hello, I'm running into a similar issue. I'm trying to run DeepSpeed (0.3.12) + Lightning (1.2.3) in the following toy example
The following script is used to launch the SLURM job:
executing this results in the error below:
Attempting to upgrade to Pytorch Lightning 1.4.0 resulted in the same error. Is there something I'm missing? |
Beta Was this translation helpful? Give feedback.
LOCAL_RANK
(and mixed-precision) issue resolved by upgrading to Lightning v1.4.