Update real_accelerator.py #6845

keiwoo · 2024-12-10T03:12:51Z

Comment out or delete `accelerate_name="cpu"` when `xpu` is not detected.

When xpu is not detected it just pass at lines from 68 to 74 if DS_ACCELERATOR is set. However, cpu is assigned to accelerate_name if it cannot import intel_extension_for_pytorch or find xpu, namely, at line from 125 to 133 whenDS_ACCELERATOR is not set.

I found this problem yesterday and spent whole afternoon figuring it out. I got intel_extension_for_pytorch installed with other package which I do not use actually and have no idea about this. Then I found that it cpu is assigned to accelerate_name directly if it cannot find xpu and it affects cuda detection. In fact, cpu will be assigned finally if cuda is even not detected at line from 170 to 177.

loadams · 2024-12-10T16:35:21Z

Hi @keiwoo - the goal of this file is to detect what accelerator you have, unless you set it with the DS_ACCELERATOR environment variable. These lines are only executed if this is false: ipex._C._has_xpu() - in that case, if the user does have intel_extensions_for_pytorch, but no XPU listed, what accelerator do we have?

Could you clarify if when you first ran this if you had intel_extension_for_pytorch installed as well as what the other package was?

Tagging @Liangliang-Ma from the XPU team as well.

keiwoo · 2024-12-11T03:27:47Z

hey @loadams, thanks for your review. I totally understand the goal of this file. We just skip the part of detection with the DS_ACCELERATOR environment variable. What you said was that

if the user does have intel_extensions_for_pytorch, but no XPU listed, what accelerator do we have?

I suggest that we can just do nothing just and keep it staying with None as it detects other accelerator below. I will show you in at line from 141 to 149.

        if accelerator_name is None:
            try:
                import torch.mps

                # should use torch.mps.is_available() if it exists someday but this is used as proxy
                torch.mps.current_allocated_memory()
                accelerator_name = "mps"
            except (RuntimeError, ImportError) as e:
                pass

In this case, we will always have torch installed, absolutly. But what accelerator do we have when torch.mps.current_allocated_memory() returns error? Of course we pass and still have accelerator_name = None until the code run line 170 to detect cuda. As you can see, if all accelerators are not detected, line 177 will return accelerator_name = "cpu" finally as I mantioned before.

                if torch.cuda.is_available():  #ignore-cuda
                    accelerator_name = "cuda"
                else:
                    if accel_logger is not None:
                        accel_logger.warn(
                            "Setting accelerator to CPU. If you have GPU or other accelerator, we were unable to detect it."
                        )
                    accelerator_name = "cpu"

Hope everything is explained well and I will test which package install intel_extensions_for_pytorch simultaneously.

keiwoo · 2024-12-11T07:14:29Z

I found the reason why intel_extension_for_pytorch was installed. It is the dependency for compiling bitsandbytes from source. link with highlight

loadams · 2024-12-11T16:08:07Z

I found the reason why intel_extension_for_pytorch was installed. It is the dependency for compiling bitsandbytes from source. link with highlight

Hi @keiwoo - I see, I misunderstood and thought you were using an XPU but were having issues detecting it, you are using another accelerator and because intel_extension_for_pytorch is installed, you're getting into this part of the file when that's undesirable - is that correct?

accelerator/real_accelerator.py

tjruwase · 2024-12-12T01:32:29Z

@keiwoo, thanks for your work here. I agree to avoiding using cpu as fallback for a specific accelerator. Although, your PR addresses the xpu cause, I think the cuda case should also be removed. And the selection of cpu added as catch-all for when accelerator detection fails (i.e., accelerator_name==None), around here.

What do you think?

Avoid using cpu as fallback for a specific accelerator and the selection of cpu added as catch-all when accelerator detection fails

keiwoo · 2024-12-12T02:57:49Z

@microsoft-github-policy-service agree

keiwoo

Agree with you @tjruwase. That would be more clear and I have made some revision. How about that?

delock · 2024-12-13T02:38:14Z

@keiwoo Thanks for this PR. Yes I think it make sense to set accelerator only when no other GPU can be found, your PR makes this intention clear. Currently for accelerator selection there are four different hints:

Existance of extension suggest use this accelerator (npu, hpu, mps)
Existance of extension + device detection suggest use this accelerator (xpu)
no extension needed, device detection (cuda)
Use this accelerator when no other accelerator selected (cpu)

I think eventurally all accelerators may need device detection to simplify environemnt management in hybrid cloud, the change to cpu detection in this PR conforms with this goal.

delock

Looks good, thanks!

This reverts commit fc7c070.

…or whl building) (#6886) This fixes a bug introduced in #6845, which breaks the `no-torch` workflow that we require in order to do releases where we do not require torch to be in the environment when building an sdist. This adds the same logic to the cpuaccelerator that the cudaaccelerator had where we don't require torch to be installed to build the whl.

keiwoo added 2 commits December 10, 2024 11:09

Update real_accelerator.py

9cf785e

Merge branch 'master' into master

d46e5fe

loadams requested a review from delock December 10, 2024 16:29

loadams self-requested a review December 10, 2024 16:43

tjruwase reviewed Dec 12, 2024

View reviewed changes

accelerator/real_accelerator.py Outdated Show resolved Hide resolved

Merge branch 'master' into master

71e83c3

Update real_accelerator.py

f98a145

Avoid using cpu as fallback for a specific accelerator and the selection of cpu added as catch-all when accelerator detection fails

keiwoo commented Dec 12, 2024

View reviewed changes

tjruwase approved these changes Dec 12, 2024

View reviewed changes

loadams and others added 2 commits December 12, 2024 11:08

Formatting

b63464f

Merge branch 'master' into master

4f41ff6

loadams approved these changes Dec 12, 2024

View reviewed changes

delock approved these changes Dec 13, 2024

View reviewed changes

loadams added 2 commits December 13, 2024 11:41

Merge branch 'master' into master

affe944

Merge branch 'master' into master

a68c603

loadams merged commit fc7c070 into microsoft:master Dec 14, 2024
11 checks passed

loadams added a commit that referenced this pull request Dec 17, 2024

Revert "Update real_accelerator.py (#6845)"

f740376

This reverts commit fc7c070.

This was referenced Dec 17, 2024

Fix no-torch workflow and update real_accelerator #6885

Closed

Don't error out when cpu accelerator doesn't have torch (as default for whl building) #6886

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update real_accelerator.py #6845

Update real_accelerator.py #6845

keiwoo commented Dec 10, 2024 •

edited

Loading

loadams commented Dec 10, 2024

keiwoo commented Dec 11, 2024

keiwoo commented Dec 11, 2024

loadams commented Dec 11, 2024

tjruwase commented Dec 12, 2024

keiwoo commented Dec 12, 2024

keiwoo left a comment

delock commented Dec 13, 2024

delock left a comment

Update real_accelerator.py #6845

Update real_accelerator.py #6845

Conversation

keiwoo commented Dec 10, 2024 • edited Loading

Comment out or delete accelerate_name="cpu" when xpu is not detected.

loadams commented Dec 10, 2024

keiwoo commented Dec 11, 2024

keiwoo commented Dec 11, 2024

loadams commented Dec 11, 2024

tjruwase commented Dec 12, 2024

keiwoo commented Dec 12, 2024

keiwoo left a comment

Choose a reason for hiding this comment

delock commented Dec 13, 2024

delock left a comment

Choose a reason for hiding this comment

keiwoo commented Dec 10, 2024 •

edited

Loading

Comment out or delete `accelerate_name="cpu"` when `xpu` is not detected.