Skip to content

Flashinfer fused MoE and flashattn v2/v3 all-in-one#31

Merged
guoqingbao merged 8 commits intomainfrom
flashinfer_moe
Mar 11, 2026
Merged

Flashinfer fused MoE and flashattn v2/v3 all-in-one#31
guoqingbao merged 8 commits intomainfrom
flashinfer_moe

Conversation

@guoqingbao
Copy link
Copy Markdown
Owner

No description provided.

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d4de938bdc

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread src/lib.rs Outdated
Comment thread src/lib.rs Outdated
@sempervictus
Copy link
Copy Markdown
Contributor

@guoqingbao are all the feature flags consolidating to flashinfer or is the thought to have impl determined at compile time and only build what we have to for thetargeg tuple?

@guoqingbao
Copy link
Copy Markdown
Owner Author

@guoqingbao are all the feature flags consolidating to flashinfer or is the thought to have impl determined at compile time and only build what we have to for thetargeg tuple?

Kernels are compiled under specific features (the all-in-one flashattn.rs is used to replace candle-flash-attn, which was controlled by the flash-attn and flash-context features), while flashinfer controls the FlashInfer attention and FlashInfer fused MoE kernel path.

@guoqingbao guoqingbao merged commit c5f0de5 into main Mar 11, 2026
1 check passed
@guoqingbao guoqingbao deleted the flashinfer_moe branch April 16, 2026 09:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants