fix import flashinfer error on AMD GPUs by akaitsuki-ii · Pull Request #148 · feifeibear/long-context-attention

akaitsuki-ii · 2025-04-25T04:06:05Z

Getting CUDA arch flags raises error during importing flashinfer on AMD GPUs. As flashinfer is not officially supported on AMD platform, just disable it when flashinfer is not installed.

eppaneamd · 2025-05-05T16:16:32Z

@akaitsuki-ii I think this fix needs to be revisited, due to:

When flashinfer is installed on AMD GPUs, HAS_FLASHINFER gets set to True in globals when e.g. MI300X returns (9, 4) from torch.cuda.get_device_capability()
- (TORCH_CUDA_ARCH_LIST is NVIDIA specific envvar)
torch_cpp_ext._get_cuda_arch_flags() function is NVIDIA specific so that will crash on AMD GPUs when HAS_FLASHINFER = True
- (There exists a rocm counterpart called _get_rocm_arch_flags)

If disabling flashinfer is desired on AMD GPUs, then the HAS_FLASHINFER check at globals could be augmented with torch.version.hip, so something like:

try:
    from flashinfer.prefill import single_prefill_with_kv_cache
    
    if torch.version.hip:
        raise ImportError("FlashInfer not supported on AMD GPUs")

    HAS_FLASHINFER = True
    def get_cuda_arch():
        major, minor = torch.cuda.get_device_capability()
        return f"{major}.{minor}"

    cuda_arch = get_cuda_arch()
    os.environ['TORCH_CUDA_ARCH_LIST'] = cuda_arch
    print(f"Set TORCH_CUDA_ARCH_LIST to {cuda_arch}")
except ImportError as e:
    print("Warning: ", type(e).__name__, "–", e)
    HAS_FLASHINFER = False

And the suggested

if HAS_FLASHINFER:
    torch_cpp_ext._get_cuda_arch_flags()

Can be added/kept (as the purpose of that function is to raise an error if unknown CUDA arch is detected).

feifeibear · 2025-05-27T01:45:07Z

Hi dose this #150 PR solve the problem.

akaitsuki-ii · 2025-05-28T07:41:18Z

@feifeibear Yes, LGTM. Thank you and let me close this PR.

fix import flashinfer error on AMD GPUs

53143b2

akaitsuki-ii closed this May 28, 2025

ZiguanWang mentioned this pull request Dec 31, 2025

[diffusion]: align sglang diffusion AMD pyproject_other.toml diffusion dependency with pyproject.toml sgl-project/sglang#16225

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix import flashinfer error on AMD GPUs#148

fix import flashinfer error on AMD GPUs#148
akaitsuki-ii wants to merge 1 commit intofeifeibear:mainfrom
akaitsuki-ii:fix_import_flashinfer

akaitsuki-ii commented Apr 25, 2025

Uh oh!

eppaneamd commented May 5, 2025 •

edited

Loading

Uh oh!

feifeibear commented May 27, 2025

Uh oh!

akaitsuki-ii commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

akaitsuki-ii commented Apr 25, 2025

Uh oh!

eppaneamd commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

feifeibear commented May 27, 2025

Uh oh!

akaitsuki-ii commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eppaneamd commented May 5, 2025 •

edited

Loading