You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are using torchrec and the two types of parallelism in our system. To have a better understanding on the details and seek the best communication primitives for our code, I want to know some details on these parallelisms. e.g., what's the specific communication primitives (operators) used on each step, (e.g., for the metadata transfer, the embedding lookup, and the pooling results, gradient transfer). Please point me to both the code and documents for these.
thanks!
The text was updated successfully, but these errors were encountered:
Hello,
We are using torchrec and the two types of parallelism in our system. To have a better understanding on the details and seek the best communication primitives for our code, I want to know some details on these parallelisms. e.g., what's the specific communication primitives (operators) used on each step, (e.g., for the metadata transfer, the embedding lookup, and the pooling results, gradient transfer). Please point me to both the code and documents for these.
thanks!
The text was updated successfully, but these errors were encountered: