[low-bit optim] Fix Adam4bit support on PyTorch 2.3 and 2.4. Update AdamFp8 torch requirement #755

gau-nernst · 2024-08-27T13:48:55Z

See #744 (comment)

pytorch-bot · 2024-08-27T13:48:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/755

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit d83a1c1 with merge base ba2d3b1 ():

NEW FAILURES - The following jobs have failed:

Run Regression Tests / test (CPU Nightly, linux.4xlarge, --pre torch --index-url https://download.pytorch.org/whl/nightl... / linux-job (gh)
test/dtypes/test_fpx.py::TestFpxTensorCoreAQTLayout::test_to_scaled_tc_fpx_compile_ebits_3_mbits_2_device_cpu
Run Regression Tests / test (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://download.pytorc... / linux-job (gh)
test/dtypes/test_fpx.py::TestFpxTensorCoreAQTLayout::test_to_scaled_tc_fpx_compile_ebits_3_mbits_2_device_cpu

This comment was automatically generated by Dr. CI and updates every 15 minutes.

gau-nernst · 2024-08-27T14:18:49Z

convert this to draft since I'm also investigating torch version support of FP8 optim. FP8 optim has never run in CI due to sm89 constraint.

gau-nernst · 2024-08-27T15:12:45Z

Fixed issue with 4-bit Adam. Now 4-bit Adam works with PyTorch 2.3 likes in the past. Hopefully CI is green. The issue seems to be related to this pytorch/pytorch#128649

I kinda feel conflicted about this change, since now the optimizer state is flattened, instead of having the same shape as param. Will try a better solution in the future. I think it has to do with dynamic compile also. 4-bit optim is giving us a lot of headaches 🤣.

stale

…damFp8 torch requirement (pytorch#755) * update doc on torch version * update doc * update * fix 4-bit problem * update doc * update

update doc on torch version

b72c941

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 27, 2024

msaroufim previously approved these changes Aug 27, 2024

View reviewed changes

update doc

5fe3cfe

gau-nernst marked this pull request as draft August 27, 2024 14:17

gau-nernst added 3 commits August 27, 2024 22:45

update

3fcad81

fix 4-bit problem

8298a35

update doc

ed4aa71

msaroufim self-requested a review August 27, 2024 15:12

gau-nernst marked this pull request as ready for review August 27, 2024 15:13

gau-nernst changed the title ~~[4-bit optim] Update doc on torch version requirement~~ [low-bit optim] Fix Adam4bit support on PyTorch 2.3 and 2.4. Update AdamFp8 torch requirement Aug 27, 2024

gau-nernst added 2 commits August 27, 2024 23:16

update

4cea083

Merge branch 'pytorch:main' into adam4bit_doc

d83a1c1

msaroufim approved these changes Sep 2, 2024

View reviewed changes

msaroufim merged commit 65f660d into pytorch:main Sep 2, 2024
15 of 17 checks passed

gau-nernst deleted the adam4bit_doc branch September 2, 2024 19:25

gau-nernst mentioned this pull request Sep 3, 2024

Low bit optimizers quality #744

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[low-bit optim] Fix Adam4bit support on PyTorch 2.3 and 2.4. Update AdamFp8 torch requirement #755

[low-bit optim] Fix Adam4bit support on PyTorch 2.3 and 2.4. Update AdamFp8 torch requirement #755

gau-nernst commented Aug 27, 2024

pytorch-bot bot commented Aug 27, 2024 •

edited

Loading

gau-nernst commented Aug 27, 2024

gau-nernst commented Aug 27, 2024 •

edited

Loading

[low-bit optim] Fix Adam4bit support on PyTorch 2.3 and 2.4. Update AdamFp8 torch requirement #755

[low-bit optim] Fix Adam4bit support on PyTorch 2.3 and 2.4. Update AdamFp8 torch requirement #755

Conversation

gau-nernst commented Aug 27, 2024

pytorch-bot bot commented Aug 27, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/755

❌ 2 New Failures

gau-nernst commented Aug 27, 2024

gau-nernst commented Aug 27, 2024 • edited Loading

pytorch-bot bot commented Aug 27, 2024 •

edited

Loading

gau-nernst commented Aug 27, 2024 •

edited

Loading