You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is NearestEmbedEMA for EMA codebook update working?
I tried training by replacing NearestEmbed with NearestEmbed, but the loss did not converge. NearestEmbed worked for my case.
The text was updated successfully, but these errors were encountered:
I am also curious about this. Can you try initializing cluster_size to be ones instead of zeros? Although the latter is done in the Sonnet's official implementation, I actually don't understand why it should be initialized to zeros, since in case no feature is an NN of a code, that code will seem to be multiplied by a big constant.
Is NearestEmbedEMA for EMA codebook update working?
I tried training by replacing NearestEmbed with NearestEmbed, but the loss did not converge. NearestEmbed worked for my case.
The text was updated successfully, but these errors were encountered: