You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.
Currently gluon Trainer iterates over the parameter dict and assign indices for multi-machine training. The index is used to identify the gradient/parameters. This relies on a deterministic order of param dict iteration and deterministic order of parameter creation. However, that is not be true if the user's code defines parameters in a random order (e.g. https://github.com/dmlc/gluon-nlp/blob/v0.9.x/src/gluonnlp/model/attention_cell.py#L223)
The text was updated successfully, but these errors were encountered:
Description
Currently gluon Trainer iterates over the parameter dict and assign indices for multi-machine training. The index is used to identify the gradient/parameters. This relies on a deterministic order of param dict iteration and deterministic order of parameter creation. However, that is not be true if the user's code defines parameters in a random order (e.g. https://github.com/dmlc/gluon-nlp/blob/v0.9.x/src/gluonnlp/model/attention_cell.py#L223)
The text was updated successfully, but these errors were encountered: