Skip to content

[GSan] Partially support TMA & cp.async ops#9699

Merged
peterbell10 merged 2 commits into
mainfrom
pb/pr-chain/gsan_partially_support_tma_ops_d1c1
Mar 20, 2026
Merged

[GSan] Partially support TMA & cp.async ops#9699
peterbell10 merged 2 commits into
mainfrom
pb/pr-chain/gsan_partially_support_tma_ops_d1c1

Conversation

@peterbell10
Copy link
Copy Markdown
Contributor

@peterbell10 peterbell10 commented Mar 12, 2026

Commits in this PR

  1. [GSan] Partially support TMA ops

    This handles TMA ops by decoding the TMA descriptor, and converting the
    calls into the equivalent load/store calls which we pass to the
    load/store instrumentation.

    This is incomplete because it doesn't model the asynchrony, but is a
    reasonable starting point.

  2. Also partially support async_copy

PR chain

  1. 👉 [GSan] Partially support TMA & cp.async ops #9699 👈 YOU ARE HERE
  2. [GSan] Add symmetric memory API #9493
  3. [GSan] Support atomics #9700

@peterbell10 peterbell10 requested a review from ptillet as a code owner March 12, 2026 14:35
@peterbell10 peterbell10 marked this pull request as draft March 12, 2026 14:36
@peterbell10 peterbell10 changed the title [GSan] Partially support TMA ops [GSan] Partially support TMA & cp.async ops Mar 12, 2026
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_add_symmetric_memory_api_a42c branch from bcb2ec6 to e8cf018 Compare March 12, 2026 21:08
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from e567d06 to 09c2e83 Compare March 12, 2026 21:08
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_add_symmetric_memory_api_a42c branch from e8cf018 to 372fe1e Compare March 12, 2026 21:17
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from 09c2e83 to eae198e Compare March 12, 2026 21:17
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_add_symmetric_memory_api_a42c branch from 372fe1e to 7138e28 Compare March 13, 2026 12:24
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from eae198e to c5c7aad Compare March 13, 2026 12:24
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_add_symmetric_memory_api_a42c branch from 7138e28 to d93ee69 Compare March 13, 2026 12:27
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from c5c7aad to 0734117 Compare March 13, 2026 12:27
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_add_symmetric_memory_api_a42c branch from d93ee69 to 9239245 Compare March 13, 2026 12:50
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from 0734117 to 3570ff8 Compare March 13, 2026 12:50
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_add_symmetric_memory_api_a42c branch from 9239245 to cac3d76 Compare March 13, 2026 22:27
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from 3570ff8 to b606ba9 Compare March 13, 2026 22:27
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_add_symmetric_memory_api_a42c branch from cac3d76 to cf31648 Compare March 13, 2026 23:46
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from b606ba9 to 90696f9 Compare March 13, 2026 23:46
@peterbell10 peterbell10 changed the base branch from pb/pr-chain/gsan_add_symmetric_memory_api_a42c to main March 16, 2026 20:30
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from 90696f9 to 870529e Compare March 16, 2026 20:30
@peterbell10 peterbell10 changed the base branch from main to pb/pr-chain/gsan_instrument_tl_load_store_cacf March 16, 2026 20:31
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch 3 times, most recently from ea94e1e to 0e0b801 Compare March 17, 2026 01:48
@peterbell10 peterbell10 marked this pull request as ready for review March 17, 2026 11:49
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_instrument_tl_load_store_cacf branch from 88e7366 to c7cd539 Compare March 19, 2026 10:54
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from 0e0b801 to 2db13d1 Compare March 19, 2026 10:54
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_instrument_tl_load_store_cacf branch from c7cd539 to a2fad55 Compare March 19, 2026 19:31
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from 2db13d1 to 52485af Compare March 19, 2026 19:31
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_instrument_tl_load_store_cacf branch from a2fad55 to 8ebfa99 Compare March 19, 2026 20:24
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from 52485af to 7a710ac Compare March 19, 2026 20:24
Base automatically changed from pb/pr-chain/gsan_instrument_tl_load_store_cacf to main March 19, 2026 22:14
This handles TMA ops by decoding the TMA descriptor, and converting the
calls into the equivalent load/store calls which we pass to the
load/store instrumentation.

This is incomplete because it doesn't model the asynchrony, but is a
reasonable starting point.

git-pr-chain: gsan_partially_support_tma_ops_d1c1
@peterbell10 peterbell10 force-pushed the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch from 7a710ac to 3b193b3 Compare March 20, 2026 12:31
@peterbell10 peterbell10 enabled auto-merge (squash) March 20, 2026 12:33
@peterbell10 peterbell10 merged commit bf56c92 into main Mar 20, 2026
25 of 27 checks passed
@peterbell10 peterbell10 deleted the pb/pr-chain/gsan_partially_support_tma_ops_d1c1 branch March 20, 2026 20:24
raymondtay pushed a commit to raymondtay/triton that referenced this pull request Mar 22, 2026
This handles TMA ops by decoding the TMA descriptor, and converting the
calls into the equivalent load/store calls which we pass to the
load/store instrumentation.
    
This is incomplete because it doesn't model the asynchrony, but is a
reasonable starting point.
jvican pushed a commit to jvican/triton that referenced this pull request Mar 27, 2026
This handles TMA ops by decoding the TMA descriptor, and converting the
calls into the equivalent load/store calls which we pass to the
load/store instrumentation.
    
This is incomplete because it doesn't model the asynchrony, but is a
reasonable starting point.
plognjen pushed a commit to plognjen/triton that referenced this pull request Apr 14, 2026
This handles TMA ops by decoding the TMA descriptor, and converting the
calls into the equivalent load/store calls which we pass to the
load/store instrumentation.
    
This is incomplete because it doesn't model the asynchrony, but is a
reasonable starting point.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants