Skip to content

[CI]: remove unused FLASHINFER_AOT_COMPILE build argument#32627

Closed
haitwang-cloud wants to merge 1 commit into
vllm-project:mainfrom
haitwang-cloud:cleanup/remove-unused-flashinfer-aot-compile-arg
Closed

[CI]: remove unused FLASHINFER_AOT_COMPILE build argument#32627
haitwang-cloud wants to merge 1 commit into
vllm-project:mainfrom
haitwang-cloud:cleanup/remove-unused-flashinfer-aot-compile-arg

Conversation

@haitwang-cloud
Copy link
Copy Markdown
Contributor

@haitwang-cloud haitwang-cloud commented Jan 20, 2026

Purpose

Remove the unused FLASHINFER_AOT_COMPILE build argument from CUDA 12.9 release image builds in the Buildkite pipeline.

The FLASHINFER_AOT_COMPILE=true build argument was being passed to docker build commands but was never declared or used in docker/Dockerfile. FlashInfer pre-compiled packages (flashinfer-cubin and flashinfer-jit-cache) are installed unconditionally in the vllm-base stage, making this argument ineffective.

This commit removes the dead code from:

  • Build release image - x86_64 - CUDA 12.9
  • Build release image - aarch64 - CUDA 12.9

Note: CUDA 13.0 release image builds already did not include this argument.

Test Plan

  1. Verify that FLASHINFER_AOT_COMPILE is not declared as an ARG in docker/Dockerfile
  2. Confirm that FlashInfer packages are installed unconditionally in vllm-base stage
  3. Ensure no other files reference this build argument

Test Result


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

@mergify mergify Bot added the ci/build label Jan 20, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request removes the unused FLASHINFER_AOT_COMPILE build argument from the Buildkite pipeline for CUDA 12.9 release image builds. The change is correct, as my analysis of docker/Dockerfile confirms the argument is neither declared nor used. This cleanup improves the clarity and maintainability of the build configuration. The change is sound and I have no further comments.

The FLASHINFER_AOT_COMPILE build argument was being passed to docker build
commands but was never declared or used in the Dockerfile.

FlashInfer pre-compiled packages (flashinfer-cubin and flashinfer-jit-cache)
are installed unconditionally in the vllm-base stage, making this argument
ineffective.

This commit removes the unused argument from CUDA 12.9 release image builds
to reduce confusion in the build configuration.

Signed-off-by: Tim <tim.wang03@sap.com>
@haitwang-cloud haitwang-cloud force-pushed the cleanup/remove-unused-flashinfer-aot-compile-arg branch from 4fd7d14 to dc4699b Compare January 20, 2026 02:14
@haitwang-cloud haitwang-cloud changed the title build: remove unused FLASHINFER_AOT_COMPILE build argument [CI]: remove unused FLASHINFER_AOT_COMPILE build argument Jan 20, 2026
@mergify
Copy link
Copy Markdown
Contributor

mergify Bot commented Jan 22, 2026

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @haitwang-cloud.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

@mergify mergify Bot added the needs-rebase label Jan 22, 2026
Harry-Chen added a commit to Harry-Chen/vllm that referenced this pull request Apr 15, 2026
The FLASHINFER_AOT_COMPILE ARG was removed from the Dockerfile in
PR vllm-project#32627, but stale references remained in the release pipeline and
docker-bake.hcl. Clean them up.

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant