Skip to content

apply normal_ after assigning weight as nn.Parameter to avoid unneces…

13faf3b
Select commit
Loading
Failed to load commit list.
Merged

Make loading of pretrained gpt2 faster by avoiding initialization of Conv1D's weights #21879

apply normal_ after assigning weight as nn.Parameter to avoid unneces…
13faf3b
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs