Skip to content

AWS OFI NCCL v1.7.2

Compare
Choose a tag to compare
@bwbarrett bwbarrett released this 25 Aug 17:58
· 378 commits to master since this release
v1.7.2-aws
a463b88

This release is intended only for use on AWS P* instances. A general release that supports other Libfabric networks will be made in the near future. This release includes the following changes:

  • Fix compilation against CUDA versions prior to 11.3.
  • Fix allocation of free lists to avoid accidently registering user data, which can cause corruption on fork() with older Linux kernels.
  • Fix memory leak with registered bounce buffers.
  • Fix improper usage of optlen in call to fi_getopt().
  • Numerous memory cleanup fixes.

This release has been tested on P3dn, P4d/P4de, and P5 using the EFA provider in Libfabric.