Cherry-pick "[RUNTIME] Implement dynamic loading with defineGetFunctionHandle for CUDA version compatibility (#2771)" by davidberard98 · Pull Request #2789 · triton-lang/triton

davidberard98 · 2023-12-12T21:53:09Z

This is needed for CUDA 11 support, which we'd like to have in the PyTorch 2.2 release.

Original commit message:

In case cuda 11 drivers are still used on some systems, we shouldn't call TMA and block cluster related functions directly. Instead, we can dynamically lookup the handles to avoid compatibility issues.

…onHandle for CUDA version compatibility (triton-lang#2771)" This is needed for CUDA 11 support, which we'd like to have in the PyTorch 2.2 release. Original commit message: In case cuda 11 drivers are still used on some systems, we shouldn't call TMA and block cluster related functions directly. Instead, we can dynamically lookup the handles to avoid compatibility issues.

…onHandle for CUDA version compatibility (triton-lang#2771)" (triton-lang#2789) This is needed for CUDA 11 support, which we'd like to have in the PyTorch 2.2 release. Original commit message: In case cuda 11 drivers are still used on some systems, we shouldn't call TMA and block cluster related functions directly. Instead, we can dynamically lookup the handles to avoid compatibility issues. Co-authored-by: Keren Zhou <kerenzhou@openai.com>

davidberard98 marked this pull request as ready for review December 12, 2023 21:53

davidberard98 requested a review from ptillet as a code owner December 12, 2023 21:53

jlebar approved these changes Dec 12, 2023

View reviewed changes

atalman approved these changes Dec 12, 2023

View reviewed changes

malfet approved these changes Dec 13, 2023

View reviewed changes

malfet merged commit e28a256 into triton-lang:release/2.2.x Dec 13, 2023

atalman mentioned this pull request May 14, 2024

Version: 2.3.1 #3912

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cherry-pick "[RUNTIME] Implement dynamic loading with defineGetFunctionHandle for CUDA version compatibility (#2771)"#2789

Cherry-pick "[RUNTIME] Implement dynamic loading with defineGetFunctionHandle for CUDA version compatibility (#2771)"#2789
malfet merged 1 commit intotriton-lang:release/2.2.xfrom
davidberard98:cherry-pick-2771

davidberard98 commented Dec 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

davidberard98 commented Dec 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants