Use allreduce_coalesced for factor allreduce #35
Labels
enhancement
New feature or request
help wanted
Extra attention is needed
pytorch-1.11
Features available in PyTorch 1.11
pytorch/pytorch#62140
"grouped comm on a set of unflattened tensors can be more performant than flattening+a single flat nccl call."
The text was updated successfully, but these errors were encountered: