[BUG] Fix dsa_sparse_finetune/sparse_mla_bwd.py bug by xiuhu17 · Pull Request #1588 · tile-ai/tilelang

xiuhu17 · 2025-12-31T21:02:44Z

Summary by CodeRabbit

Refactor
- Updated backward kernel's atomic accumulation mechanism to modify data access patterns and control flow for atomic operations.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-31T21:02:55Z

📝 Walkthrough

Walkthrough

Replaces vectorized 4-wide atomic operations with single-element atomic updates in the sparse backward kernel. Loop structure adjusts from iterating over (BS // split_store, D // 4) to (BS // split_store, D), with index mapping shifted from block-based (d_i * 4) to element-wise (d_i) addressing, modifying the atomic accumulation scheme for gradient computation.

Changes

Cohort / File(s)	Summary
Sparse MLA Backward Kernel Atomics `examples/dsa_sparse_finetune/sparse_mla_bwd.py`	Changed atomic accumulation from vectorized 4-wide block updates (atomic_addx4) to single-element updates (atomic_add); adjusted loop bounds from D//4 to D and D_tail//4 to D_tail; updated index mapping from d_i*4 to d_i for dKV and dKV_tail gradient accumulators

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

[GQA] Add regional atomic add to slightly boost performance #1093: Modifies atomic accumulation patterns in backward kernels, converting between vectorized and element-wise update schemes in similar code regions.

Suggested reviewers

LeiWang1999

Poem

🐰 Hop, hop—those atomics dance,
Four-wide blocks take their last prance,
Now singles flutter, swift and lean,
The finest gradients I've ever seen! ✨

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately identifies the specific file and nature of the change, clearly indicating a bug fix in sparse_mla_bwd.py.

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between dcacc5a and 6fbbed1.

📒 Files selected for processing (1)

examples/dsa_sparse_finetune/sparse_mla_bwd.py

🔇 Additional comments (2)

examples/dsa_sparse_finetune/sparse_mla_bwd.py (2)

229-233: LGTM!

The fix correctly replaces the 4-wide vectorized atomic with single-element atomic updates. The loop bounds (BS // split_store, D) match the shared memory view shape, and the indexing is consistent with the copy operation at lines 221-223.

236-240: LGTM!

The tail dimension atomic update mirrors the main dimension fix correctly. Loop bounds (BS // split_store, D_tail) match the shared view, and the D + d_i offset correctly addresses the tail portion of dKV.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2025-12-31T21:02:55Z

👋 Hi! Thank you for contributing to the TileLang project.

Please remember to run pre-commit run --all-files in the root directory of the project to ensure your changes are properly linted and formatted. This will help ensure your contribution passes the format check.

We appreciate you taking this step! Our team will review your contribution, and we look forward to your awesome work! 🚀

LeiWang1999 · 2026-01-01T05:26:15Z

surprised to find the atomic_addx4 is buggy here, I'll also take a look :)

xiuhu17 · 2026-01-01T05:42:05Z

x2 also seems causing issues. Guess might be a padding issue related to the thd format. Thanks for commenting

LeiWang1999 · 2026-01-18T13:23:09Z

LGTM, and after pr #1677 , atomc_add will be automatically lowered into atomic_addx4 if possible :)

update

6fbbed1

LeiWang1999 approved these changes Jan 1, 2026

View reviewed changes

Merge branch 'tile-ai:main' into dev_

de3d980

LeiWang1999 merged commit bb7f30c into tile-ai:main Jan 18, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Fix dsa_sparse_finetune/sparse_mla_bwd.py bug#1588

[BUG] Fix dsa_sparse_finetune/sparse_mla_bwd.py bug#1588
LeiWang1999 merged 2 commits intotile-ai:mainfrom
xiuhu17:dev_

xiuhu17 commented Dec 31, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 31, 2025 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

github-actions bot commented Dec 31, 2025

Uh oh!

LeiWang1999 commented Jan 1, 2026

Uh oh!

xiuhu17 commented Jan 1, 2026

Uh oh!

LeiWang1999 commented Jan 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xiuhu17 commented Dec 31, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

github-actions bot commented Dec 31, 2025

Uh oh!

LeiWang1999 commented Jan 1, 2026

Uh oh!

xiuhu17 commented Jan 1, 2026

Uh oh!

LeiWang1999 commented Jan 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xiuhu17 commented Dec 31, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 31, 2025 •

edited

Loading