Skip to content

Conversation

@GuanhuaWang
Copy link
Contributor

@GuanhuaWang GuanhuaWang commented Nov 6, 2023

This PR is the blog for ZeRO-Offload++, it describes the details of how our new Twin-Flow feature works and its performance numbers on both DGX-A100 and DGX-H100 machines.

Corresponding code PR is #4636

cc @jeffra @awan-10 @tjruwase @mrwyattii

jeffra and others added 2 commits November 6, 2023 11:56
Co-authored-by: GuanhuaWang <alexwgh333@gmail.com>
@jeffra jeffra self-requested a review November 6, 2023 20:29
Copy link
Collaborator

@jeffra jeffra left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🎉

@jeffra jeffra merged commit 7e480ea into master Nov 6, 2023
@jeffra jeffra deleted the offloadpp-blog branch November 6, 2023 22:17
mauryaavinash95 pushed a commit to mauryaavinash95/DeepSpeed that referenced this pull request Feb 17, 2024
This PR is the blog for ZeRO-Offload++, it describes the details of how
our new Twin-Flow feature works and its performance numbers on both
DGX-A100 and DGX-H100 machines.

Corresponding code PR is
deepspeedai#4636

cc @jeffra @awan-10 @tjruwase @mrwyattii

---------

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants