bugfix: hotfix of PR 2366 (mamba kernel)#2378
Conversation
|
/bot run |
📝 WalkthroughWalkthroughA new CUDA error-checking macro Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Poem
🚥 Pre-merge checks | ✅ 3✅ Passed checks (3 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
[FAILED] Pipeline #42094754: 10/20 passed |
|
Failed UTs are not relevant, should be ready to merge. |
📌 Description
The commit 1938c5c we added to #2366 cause hanging issue when running the unittests because the
FLASHINFER_CUDA_CALLmacro expects the caller environment returns acudaError_twhich contradicts with the structure of the lambda functions inside invokeSelectiveStateUpdate of mamba kernel. This PR fixes the issue by adding another macroFLASHINFER_CUDA_CHECK.🔍 Related Issues
#2366
🚀 Pull Request Checklist
Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete.
✅ Pre-commit Checks
pre-commitby runningpip install pre-commit(or used your preferred method).pre-commit install.pre-commit run --all-filesand fixed any reported issues.🧪 Tests
unittest, etc.).Reviewer Notes
Summary by CodeRabbit
✏️ Tip: You can customize this high-level summary in your review settings.