Skip to content

Conversation

@stas00
Copy link
Contributor

@stas00 stas00 commented Aug 10, 2021

This PR catches on 2 commits:

git remote add other https://github.com/microsoft/Megatron-DeepSpeed
git fetch other
git cherry-pick 3123a2943173
# resolve conflict
git cherry-pick f0131ce0287f008d85101fd

New checkpoint is deepspeedai/Megatron-DeepSpeed@f0131ce028

@stas00 stas00 merged commit 3c9d748 into bigscience-workshop:main Aug 10, 2021
@stas00 stas00 deleted the sync3 branch August 10, 2021 04:27
stas00 added a commit that referenced this pull request Aug 10, 2021
* Use new zero.Init() API (#10)

* query deepspeed global grad norm (#8)

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
@stas00
Copy link
Contributor Author

stas00 commented Aug 10, 2021

updated tr1-13B as well:

git checkout tr1-13B
git cherry-pick 3c9d748
git push

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants