bugfix: hotfix of PR 2366 (mamba kernel) by yzh119 · Pull Request #2378 · flashinfer-ai/flashinfer

yzh119 · 2026-01-20T08:31:05Z

📌 Description

The commit 1938c5c we added to #2366 cause hanging issue when running the unittests because the FLASHINFER_CUDA_CALL macro expects the caller environment returns a cudaError_t which contradicts with the structure of the lambda functions inside invokeSelectiveStateUpdate of mamba kernel. This PR fixes the issue by adding another macro FLASHINFER_CUDA_CHECK.

🔍 Related Issues

#2366

🚀 Pull Request Checklist

Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.

✅ Pre-commit Checks

I have installed pre-commit by running pip install pre-commit (or used your preferred method).
I have installed the hooks with pre-commit install.
I have run the hooks manually with pre-commit run --all-files and fixed any reported issues.

If you are unsure about how to set up pre-commit, see the pre-commit documentation.

🧪 Tests

Tests have been added or updated as needed.
All tests are passing (unittest, etc.).

Reviewer Notes

Summary by CodeRabbit

Refactor
- Improved CUDA error handling mechanisms to provide clearer error messages and enhanced diagnostics during kernel configuration and execution.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

yzh119 · 2026-01-20T08:31:15Z

/bot run

coderabbitai · 2026-01-20T08:31:17Z

📝 Walkthrough

Walkthrough

A new CUDA error-checking macro FLASHINFER_CUDA_CHECK is introduced in the utilities header, which validates CUDA function results and formats error messages with context. Two existing error-check invocations in the selective state update kernel are updated to use this new macro.

Changes

Cohort / File(s)	Summary
CUDA Error Handling Macro Definition `include/flashinfer/utils.cuh`	Adds `FLASHINFER_CUDA_CHECK(func)` macro that executes a CUDA function and validates the result equals `cudaSuccess`, emitting formatted error messages with CUDA error string, code, file, line, and function text.
Macro Usage in Selective State Update `include/flashinfer/mamba/selective_state_update.cuh`	Updates two instances of `cudaFuncSetAttribute` calls to use the new `FLASHINFER_CUDA_CHECK` macro instead of `FLASHINFER_CUDA_CALL` for consistent error handling.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Poem

🐰 A macro hops in to check CUDA's way,
Validating success throughout the day,
Error messages clear in the morning light,
Two kernels now check in a unified might!
Better error handling, what a delight!

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and specifically describes the main change: fixing a hotfix related to PR 2366 affecting the mamba kernel.
Description check	✅ Passed	The description follows the template structure with a clear explanation of the issue, related issues linked, and pre-commit/test checklist items addressed.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

flashinfer-bot · 2026-01-20T08:31:49Z

GitLab MR !249 has been created, and the CI pipeline #42094754 is currently running. I'll report back once the pipeline job completes.

flashinfer-bot · 2026-01-20T22:17:35Z

[FAILED] Pipeline #42094754: 10/20 passed

yzh119 · 2026-01-20T22:43:19Z

Failed UTs are not relevant, should be ready to merge.

upd

a5dd120

yzh119 requested review from IwakuraRein, jiahanc and kahyunnam as code owners January 20, 2026 08:31

yzh119 mentioned this pull request Jan 20, 2026

feat: [Qwen3-Next] Add Cute DSL GDN decode kernel and tests #2370

Merged

cyx-6 approved these changes Jan 20, 2026

View reviewed changes

yzh119 added the v0.6.2 label Jan 20, 2026

ishovkun mentioned this pull request Jan 20, 2026

A Blackwell-optimized version of selective_state_update (mamba) #2387

Merged

5 tasks

yzh119 merged commit 386bc77 into flashinfer-ai:main Jan 20, 2026
26 of 38 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: hotfix of PR 2366 (mamba kernel)#2378

bugfix: hotfix of PR 2366 (mamba kernel)#2378
yzh119 merged 1 commit intoflashinfer-ai:mainfrom
yzh119:hotfix-2366

yzh119 commented Jan 20, 2026 •

edited by yongwww

Loading

Uh oh!

yzh119 commented Jan 20, 2026

Uh oh!

coderabbitai bot commented Jan 20, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

flashinfer-bot commented Jan 20, 2026

Uh oh!

flashinfer-bot commented Jan 20, 2026

Uh oh!

yzh119 commented Jan 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yzh119 commented Jan 20, 2026 • edited by yongwww Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📌 Description

🔍 Related Issues

🚀 Pull Request Checklist

✅ Pre-commit Checks

🧪 Tests

Reviewer Notes

Summary by CodeRabbit

Uh oh!

yzh119 commented Jan 20, 2026

Uh oh!

coderabbitai bot commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

flashinfer-bot commented Jan 20, 2026

Uh oh!

flashinfer-bot commented Jan 20, 2026

Uh oh!

yzh119 commented Jan 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yzh119 commented Jan 20, 2026 •

edited by yongwww

Loading

coderabbitai bot commented Jan 20, 2026 •

edited

Loading