Support computing parameter count in ModelSummary for FSDP models #20151
Labels
callback: model summary
feature
Is an improvement or enhancement
strategy: fsdp
Fully Sharded Data Parallel
Milestone
Description & Motivation
Models that are set up with FSDP (or DTensor) do not show the total parameter count in the ModelSummary.
Pitch
Compute the shapes correctly (similar to the DeepSpeed summary).
Alternatives
No response
Additional context
No response
cc @Borda @awaelchli @carmocca
The text was updated successfully, but these errors were encountered: