Skip to content

Minor patch release

Compare
Choose a tag to compare
@Borda Borda released this 12 Apr 15:31
· 39 commits to release/LTS since this release
a020506

App

Changed

  • Added healthz endpoint to plugin server (#16882)
  • System customization syncing for jobs run (#16932)

Fabric

Changed

  • Let TorchCollective works on the torch.distributed WORLD process group by default (#16995)

Fixed

  • fixed for all _cuda_clearCublasWorkspaces on teardown (#16907)
  • Improved the error message for installing tensorboard or tensorboardx (#17053)

PyTorch

Changed

  • Changed to the NeptuneLogger (#16761):
    • It now supports neptune-client 0.16.16 and neptune >=1.0, and we have replaced the log() method with append() and extend().
    • It now accepts a namespace Handler as an alternative to Run for the run argument. This means that you can call it like NeptuneLogger(run=run["some/namespace"]) to log everything to the some/namespace/ location of the run.
  • Allow sys.argv and args in LightningCLI (#16808)
  • Moveed HPU broadcast override to the HPU strategy file (#17011)

Depercated

  • Removed registration of ShardedTensor state dict hooks in LightningModule.__init__ with torch>=2.1 (#16892)
  • Removed the lightning.pytorch.core.saving.ModelIO class interface (#16974)

Fixed

  • Fixed num_nodes not being set for DDPFullyShardedNativeStrategy (#17160)
  • Fixed parsing the precision config for inference in DeepSpeedStrategy (#16973)
  • Fixed the availability check for rich that prevented Lightning to be imported in Google Colab (#17156)
  • Fixed for all _cuda_clearCublasWorkspaces on teardown (#16907)
  • The psutil package is now required for CPU monitoring (#17010)
  • Improved the error message for installing tensorboard or tensorboardx (#17053)

Contributors

@awaelchli, @belerico, @carmocca, @colehawkins, @dmitsf, @Erotemic, @ethanwharris, @kshitij12345, @Borda

If we forgot someone due to not matching commit email with GitHub account, let us know :]