Update Flashinfer from `v0.4.1` to `v0.5.2` #27952

hmellor · 2025-11-03T01:30:27Z

Bump Flashinfer to v0.5.2
Remove workaround from docker build
Allows us to stop needing --pre when installing from source (fixes [Installation]: FlashInfer Dependency issue due to pre-release apache-tvm-ffi #27476)

N.B. xformers is also causing --pre to be required at the moment

Signed-off-by: Harry Mellor <[email protected]>

gemini-code-assist

Code Review

This pull request updates Flashinfer from version v0.4.1 to v0.5.0. The changes include updating the package versions in requirements/cuda.txt and the Dockerfiles, as well as removing a related workaround that is no longer necessary. The modifications are consistent and correctly implement the version bump. The changes look good.

youkaichao · 2025-11-03T02:52:10Z

cc @mgoin @pavanimajety

hmellor · 2025-11-03T18:54:50Z

Both Blackwell tests passed in last night's nightly. So these appear to be new and legitimate failures.

Signed-off-by: Harry Mellor <[email protected]>

lgeiger · 2025-11-04T16:12:30Z

Shall we try whether v0.5.1 fixed it?

hmellor · 2025-11-04T18:12:13Z

The changelog doesn't look like it fixes any bugs. We have a fix for the unquantised test (relaxing the tolerances), but I'm still waiting for a solution for the quantised moe test.

hmellor · 2025-11-04T18:44:31Z

MoE investigation here: flashinfer-ai/flashinfer#2032

Signed-off-by: Harry Mellor <[email protected]>

hmellor · 2025-11-07T03:42:14Z

flashinfer-ai/flashinfer#2049 contains the MoE fix and was included in 0.5.2 released 1h ago

yewentao256

LGTM, thanks for the work!
Could we trigger the full tests (including all optional) for this change?

hmellor · 2025-11-07T23:33:27Z

Changing the requirements/dockerfile already does trigger full CI.

There are some optional ones that are only run nightly, but many of those are failing already and so would block this PR from merging.

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Signed-off-by: Harry Mellor <[email protected]>

Update Flashinfer from v0.4.1 to v0.5.0

b84ac77

Signed-off-by: Harry Mellor <[email protected]>

hmellor added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 3, 2025

mergify bot added the ci/build label Nov 3, 2025

gemini-code-assist bot reviewed Nov 3, 2025

View reviewed changes

simon-mo approved these changes Nov 3, 2025

View reviewed changes

simon-mo added this to the v0.11.1 milestone Nov 3, 2025

Increase tolerances for Flashinfer TRTLLM test

475c4b5

Signed-off-by: Harry Mellor <[email protected]>

seindum mentioned this pull request Nov 5, 2025

[Installation]: xformers issue #27851

Closed

1 task

Update to 0.5.2

b1bb529

Signed-off-by: Harry Mellor <[email protected]>

hmellor requested review from WoosukKwon, mgoin, tlrmchlsmth and yewentao256 as code owners November 7, 2025 03:41

hmellor changed the title ~~Update Flashinfer from v0.4.1 to v0.5.0~~ Update Flashinfer from v0.4.1 to v0.5.2 Nov 7, 2025

Merge branch 'main' into flashinfer-update

4ef40c2

jiahanc mentioned this pull request Nov 7, 2025

[Performance] Support FP8 flashinfer TRTLLM MOE on Qwen3 and Qwen-3next #27492

Merged

5 tasks

yewentao256 approved these changes Nov 7, 2025

View reviewed changes

hmellor merged commit 811df41 into vllm-project:main Nov 8, 2025
91 checks passed

hmellor deleted the flashinfer-update branch November 8, 2025 00:24

njhill mentioned this pull request Nov 8, 2025

Update the flashinfer version to fix the incorrect method call "isdigit" on a number type #27730

Closed

5 tasks

usberkeley mentioned this pull request Nov 10, 2025

Add use_flashinfer_sampler function with device capability check #28379

Closed

5 tasks

mgoin added this to NVIDIA Nov 11, 2025

mgoin added the nvidia label Nov 11, 2025

mgoin moved this to Done in NVIDIA Nov 11, 2025

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Nov 13, 2025

Update Flashinfer from v0.4.1 to v0.5.2 (vllm-project#27952)

6a29d4d

Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

Update Flashinfer from v0.4.1 to v0.5.2 (vllm-project#27952)

30863be

Signed-off-by: Harry Mellor <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Update Flashinfer from `v0.4.1` to `v0.5.2` #27952

Update Flashinfer from `v0.4.1` to `v0.5.2` #27952

hmellor commented Nov 3, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

youkaichao commented Nov 3, 2025

Uh oh!

hmellor commented Nov 3, 2025

Uh oh!

lgeiger commented Nov 4, 2025

Uh oh!

hmellor commented Nov 4, 2025

Uh oh!

hmellor commented Nov 4, 2025

Uh oh!

hmellor commented Nov 7, 2025

Uh oh!

yewentao256 left a comment •

edited

Loading

Uh oh!

hmellor commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Update Flashinfer from v0.4.1 to v0.5.2 #27952

Update Flashinfer from v0.4.1 to v0.5.2 #27952

Conversation

hmellor commented Nov 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

youkaichao commented Nov 3, 2025

Uh oh!

hmellor commented Nov 3, 2025

Uh oh!

lgeiger commented Nov 4, 2025

Uh oh!

hmellor commented Nov 4, 2025

Uh oh!

hmellor commented Nov 4, 2025

Uh oh!

hmellor commented Nov 7, 2025

Uh oh!

yewentao256 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hmellor commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Update Flashinfer from `v0.4.1` to `v0.5.2` #27952

Update Flashinfer from `v0.4.1` to `v0.5.2` #27952

hmellor commented Nov 3, 2025 •

edited by github-actions bot

Loading

yewentao256 left a comment •

edited

Loading