Flashinfer fused MoE and flashattn v2/v3 all-in-one by guoqingbao · Pull Request #31 · guoqingbao/attention.rs

guoqingbao · 2026-03-09T03:06:01Z

No description provided.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d4de938bdc

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

sempervictus · 2026-03-09T13:04:27Z

@guoqingbao are all the feature flags consolidating to flashinfer or is the thought to have impl determined at compile time and only build what we have to for thetargeg tuple?

guoqingbao · 2026-03-10T04:05:32Z

@guoqingbao are all the feature flags consolidating to flashinfer or is the thought to have impl determined at compile time and only build what we have to for thetargeg tuple?

Kernels are compiled under specific features (the all-in-one flashattn.rs is used to replace candle-flash-attn, which was controlled by the flash-attn and flash-context features), while flashinfer controls the FlashInfer attention and FlashInfer fused MoE kernel path.

guoqingbao added 3 commits March 5, 2026 11:21

Add flashinfer fused moe kernels

3617b4e

Use flashattn.rs (v2/v3 all in one crate)

d4de938

Merge branch 'main' into flashinfer_moe

0bc8102

chatgpt-codex-connector Bot reviewed Mar 9, 2026

View reviewed changes

Comment thread src/lib.rs Outdated

Comment thread src/lib.rs Outdated

Fix softcap and sliding_window

92a03c0

sempervictus mentioned this pull request Mar 9, 2026

Enable precision flags for flashinfer compilation #30

Open

guoqingbao added 4 commits March 10, 2026 11:19

Support deep_gemm on Hopper

53ab896

Update fused rope for Metal

9be89ac

Fix build for sm_80

ec54281

Simplify flashattn feature

0050660

guoqingbao merged commit c5f0de5 into main Mar 11, 2026
1 check passed

guoqingbao deleted the flashinfer_moe branch April 16, 2026 09:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flashinfer fused MoE and flashattn v2/v3 all-in-one#31

Flashinfer fused MoE and flashattn v2/v3 all-in-one#31
guoqingbao merged 8 commits intomainfrom
flashinfer_moe

guoqingbao commented Mar 9, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

sempervictus commented Mar 9, 2026

Uh oh!

guoqingbao commented Mar 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

guoqingbao commented Mar 9, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

sempervictus commented Mar 9, 2026

Uh oh!

guoqingbao commented Mar 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants