Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Domino Blog
#6776 opened Nov 22, 2024 by GuanhuaWang Loading…
Fix Doc Error: ZeRO Stage 2 gradient partitioning
#6775 opened Nov 21, 2024 by yewentao256 Loading…
Stage3: Use new torch grad accumulation hooks API
#6773 opened Nov 21, 2024 by deepcharm Loading…
Check transformers version in BLOOM for inference v1
#6766 opened Nov 19, 2024 by lekurile Loading…
BLOOM fixes for DS Legacy Inference
#6765 opened Nov 19, 2024 by lekurile Draft
Flops profiler support einops.einsum
#6755 opened Nov 17, 2024 by lvhoaa Loading…
Fix building on Windows with presence of Triton
#6749 opened Nov 14, 2024 by woct0rdho Loading…
Update flake8 version
#6740 opened Nov 11, 2024 by loadams Loading…
Update formatting workflow
#6738 opened Nov 11, 2024 by loadams Loading…
Merge LoCo with Zero++
#6730 opened Nov 8, 2024 by XingyuXie Loading…
Support latest transformers with DSChat
#6711 opened Nov 4, 2024 by loadams Loading…
Update MII tests to support transformers latest
#6686 opened Oct 29, 2024 by loadams Loading…
Allow to compile collective for PT > 2.3
#6674 opened Oct 27, 2024 by nelyahu Loading…
modify_load_save_model
#6626 opened Oct 15, 2024 by ssklzx Loading…
Improve consistency of zero_grad
#6554 opened Sep 18, 2024 by tohtana Draft
Set shuffle=True by default in data_sampler
#6531 opened Sep 13, 2024 by ranzhejiang Loading…
Adding the new feature of FPDT
#6462 opened Aug 29, 2024 by YJHMITWEB Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.