FP16_Optimizer Support for more Deepspeed Versions #12046

Lafi7e · 2022-06-30T07:20:28Z

Add FP16_Optimizer support for more Deepspeed versions. Also add some warn message for user to check if the overrides take effect or not.

pengwa · 2022-06-30T08:05:03Z

orttraining/orttraining/python/training/optim/_ds_modifier.py

 import warnings
 from distutils.version import LooseVersion
+
+import deepspeed


we cannot put "import deepspeed" here, it might break when import the module without deepspeed installed.

right. just fixed it. but I think we don't need to try catch as when it goes there deepspeed must be installed.

* fp16_optimizer for more ds versions * change ds version * bugfix * fix bug

* support optimizer opt for deepspeed 0.5.9 * resolve comments * resolve comments * FP16_Optimizer Support for more Deepspeed Versions (#12046) * fp16_optimizer for more ds versions * change ds version * bugfix * fix bug * Fix unused function warning for decodeMIDR(). (#12069) Changed from static function defined in header to function declared in header and defined in separate .cc file. * pin protobuf version to be compatible with onnx (#12132) Co-authored-by: Ashwini Khade <[email protected]@orttrainingdev10.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> * RoiAlign CPU EP add warning for max mode with samples != 1 (#12136) * RoiAlign add warning about incorrect max summation when sample size not 1 * include coreml_provider_factory.h in macos build instead of coreml_ex… (#12138) include coreml_provider_factory.h in macos build instead of coreml_execution_provider.h * List 3.10 as supported python version and remove 3.6 (#12141) list 3.10 as supported python version and remove 3.6 Co-authored-by: Randy Shuai <[email protected]> * Use updated symbolic_helper.check_training_mode (#11900) Co-authored-by: Jingyan Wang, Baiju Meswani * Fix GH issue 12151 by using inverse perms for updating DQ axis attribute (#12158) * Fix GH issue 12151. Need to use inverse perms for updating that axis to what is used for transposing the input. This only applies if the DQ node is doing per-axis dequantization. * fixing positions for beam search gpt2 (#12156) * fixing positions for beam search gpt2 Co-authored-by: Tianlei Wu <[email protected]> * remove wrong placed libs (#12201) * Add file mapping for windows platform. (#12183) * Add file mapping for windows platform. * Add unit test for file mapping for windows. Also add an error message for mis-aligned offset * Add unit test for file mapping for windows. Also add an error message for mis-aligned offset * Update data type to avoid warnings * Compitable data type to avoid warnings. Update CreatFileMapping2 condition for winml compiling. * Add type conversion to avoid warnings for X86 release build. Co-authored-by: Ting Cao <[email protected]> * Fix bug where onnxruntime_USE_NCCL flag would default to ON (#12195) Fix bug where onnxruntime_USE_NCCL flag would default to ON, causing ORT to not build properly. New functionality: flag is ON when training is enabled and NCCL is not disabled. Flag is OFF otherwise Co-authored-by: zhijxu <[email protected]> Co-authored-by: zhijxu <zhijxu> Co-authored-by: Vincent Wang <[email protected]> Co-authored-by: Edward Chen <[email protected]> Co-authored-by: Ashwini Khade <[email protected]> Co-authored-by: Ashwini Khade <[email protected]@orttrainingdev10.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Dwayne Robinson <[email protected]> Co-authored-by: Carson Swope <[email protected]> Co-authored-by: Randy Shuai <[email protected]> Co-authored-by: jingyanwangms <[email protected]> Co-authored-by: Scott McKay <[email protected]> Co-authored-by: Viswanath Boga <[email protected]> Co-authored-by: leqiao-1 <[email protected]> Co-authored-by: caoting-dotcom <[email protected]> Co-authored-by: Ting Cao <[email protected]> Co-authored-by: Sean Murray <[email protected]>

Lafi7e added 3 commits June 30, 2022 14:31

fp16_optimizer for more ds versions

a8d0500

Merge branch 'master' into weicwang/fp16_opt

dfb9bbb

change ds version

ddf3c56

Lafi7e added the component:ortmodule label Jun 30, 2022

Lafi7e requested review from pengwa and zhijxu-MS June 30, 2022 07:20

bugfix

e7e6e6f

pengwa reviewed Jun 30, 2022

View reviewed changes

zhijxu-MS previously approved these changes Jun 30, 2022

View reviewed changes

fix bug

eb8d263

Lafi7e dismissed zhijxu-MS’s stale review via eb8d263 June 30, 2022 08:12

pengwa approved these changes Jun 30, 2022

View reviewed changes

Lafi7e merged commit 04f7c2d into master Jun 30, 2022

Lafi7e deleted the weicwang/fp16_opt branch June 30, 2022 10:36

hanbitmyths added the release:1.12 label Jul 14, 2022

RandySheriffH pushed a commit that referenced this pull request Jul 18, 2022

FP16_Optimizer Support for more Deepspeed Versions (#12046)

12c9628

* fp16_optimizer for more ds versions * change ds version * bugfix * fix bug

RandySheriffH mentioned this pull request Jul 18, 2022

Cherry for release 1.12.0 final #12218

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FP16_Optimizer Support for more Deepspeed Versions #12046

FP16_Optimizer Support for more Deepspeed Versions #12046

Uh oh!

Lafi7e commented Jun 30, 2022

Uh oh!

pengwa Jun 30, 2022

Uh oh!

Lafi7e Jun 30, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

FP16_Optimizer Support for more Deepspeed Versions #12046

FP16_Optimizer Support for more Deepspeed Versions #12046

Uh oh!

Conversation

Lafi7e commented Jun 30, 2022

Uh oh!

pengwa Jun 30, 2022

Choose a reason for hiding this comment

Uh oh!

Lafi7e Jun 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Lafi7e Jun 30, 2022 •

edited

Loading