Conversation
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Remove variant/ROCm/alias handling -- just generate a flat PEP 503 index. Install uv with Python 3.12 for faster wheel builds. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
You can try it out with |
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 9c66d3671e
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
| if suffix.startswith(("rocm", "cu", "cpu")): | ||
| variant = suffix |
There was a problem hiding this comment.
Parse +npu/+xpu release suffixes as variants
Update parse_from_filename to recognize all release device suffixes the project can emit. The current check only treats rocm, cu, and cpu suffixes as variants, so release wheels like ...+npu or ...+xpu (produced by setup.py) are misclassified as the default variant. If multiple device wheels are uploaded under one commit/version directory, the generated default index can point at NPU/XPU artifacts instead of isolating them under variant subdirectories, which leads to incorrect wheel resolution from the generic index.
Useful? React with 👍 / 👎.
|
Sample run: https://buildkite.com/vllm/omni-release/builds/9 |
Gaohan123
left a comment
There was a problem hiding this comment.
LGTM. Let's check if it can solve the pipeline failure https://buildkite.com/vllm/omni-release/builds/21/steps/canvas
|
Could you add a documentation explaining how users can use this feature? Thank you very much. |
Signed-off-by: khluu <khluu000@gmail.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Summary
Add a nightly wheel release pipeline for vllm-omni. Wheels are hosted on the existing
vllm-wheelsS3 bucket under theomni/prefix, served viawheels.vllm.ai.How it works
uvwith Python 3.12, runspython3 -m buildto produce a pure Python wheels3://vllm-wheels/omni/<commit>/s3://vllm-wheels/omni/<commit>/(always)s3://vllm-wheels/omni/nightly/(whenNIGHTLY=1env var is set)s3://vllm-wheels/omni/<version>/(for non-dev release versions only)The nightly index at
omni/nightly/uses relative paths pointing back to the latest commit's wheel directory.Installation
Files added
.buildkite/nightly-release-pipeline.yaml.buildkite/scripts/upload-nightly-wheels.shdist/to S3 underomni/<commit>/.buildkite/scripts/generate-and-upload-nightly-index.sh.buildkite/scripts/generate-nightly-index.pyS3 layout
🤖 Generated with Claude Code