Upgrade to CUDA 12.9 by regro-cf-autotick-bot · Pull Request #5 · conda-forge/vllm-feedstock

regro-cf-autotick-bot · 2025-07-29T22:56:44Z

This PR has been triggered in an effort to update cuda129.

Notes and instructions for merging this PR:

Please merge the PR only after the tests have passed.
Feel free to push to the bot's branch to update this PR if needed.

Please note that if you close this PR we presume that the feedstock has been rebuilt, so if you are going to perform the rebuild yourself don't close this PR until the your rebuild has been merged.

Here are some more details about this specific migrator:

CUDA 12.8 added support for architectures sm_100, sm_101 and sm_120,
while CUDA 12.9 further added sm_103 and sm_121. To build for these,
maintainers will need to modify their existing list of specified architectures
(e.g. CMAKE_CUDA_ARCHITECTURES, TORCH_CUDA_ARCH_LIST, etc.)
for their package. A good balance between broad support and storage
footprint (resp. compilation time) is to add sm_100 and sm_120.

Since CUDA 12.8, the conda-forge nvcc package now sets CUDAARCHS and
TORCH_CUDA_ARCH_LIST in its activation script to a string containing all
of the supported real architectures plus the virtual architecture of the
latest. Recipes for packages who use these variables to control their build
but do not want to build for all supported architectures will need to override
these variables in their build script.

ref: https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#new-features

If this PR was opened in error or needs to be updated please add the bot-rerun label to this PR. The bot will close this PR and schedule another one. If you do not have permissions to add this label, you can use the phrase @conda-forge-admin, please rerun bot in a PR comment to have the conda-forge-admin add it for you.

_{This PR was created by the regro-cf-autotick-bot. The regro-cf-autotick-bot is a service to automatically track the dependency graph, migrate packages, and propose package version updates for conda-forge. Feel free to drop us a line if there are any issues! This PR was generated by https://github.com/regro/cf-scripts/actions/runs/16608834922 - please use this URL for debugging.}

Closes #14

conda-forge-admin · 2025-07-29T22:58:17Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/recipe.yaml) and found it was in an excellent condition.

I do have some suggestions for making it better though...

For recipe/recipe.yaml:

ℹ️ The recipe is not parsable by parser conda-recipe-manager. The recipe can only be automatically migrated to the new v1 format if it is parseable by conda-recipe-manager.

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/16609234881. Examine the logs at this URL for more detail.}

shermansiu · 2025-07-29T23:12:05Z

@conda-forge-admin, please re-render

conda-forge-admin · 2025-07-29T23:14:30Z

Hi! This is the friendly automated conda-forge-webservice.

I tried to rerender for you, but it looks like there was nothing to do.

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/16609450927. Examine the logs at this URL for more detail.}

h-vetinari · 2025-07-30T03:16:42Z

Marking as draft so the bot doesn't auto close (it's almost always better to rebase directly rather than rerun the bot).

When doing an interactive rebase of a bot PR: fix up minor conflicts (realistically only the build number might conflict), ensure the build number is increased (might be dropped as "has already happened on main"), and don't hesitate to drop the rerender commit and redo it either manually (or let the bot do it).

h-vetinari · 2025-07-30T03:20:59Z

As commented on #6, this PR should ideally be rebased (but if you're not comfortable with that, feel free to ask the bot to rerun).

For some reason the CI on the server did not run in #6, but does run on main. This is strange, and not in line with the config merged in conda-forge/.cirun#109 (which has pull_request: true). Before digging further, I'd like to see if CI in PRs works now that the config has been merged to main.

h-vetinari · 2025-07-30T03:29:08Z

Also, please remove automerge as explained in #2

shermansiu · 2025-07-30T04:36:01Z

Automerge removed!

h-vetinari · 2025-07-30T05:17:03Z

You're going to need to debug the cross compiled builds (since they never ran in #6). Something is not yet working with

vllm-feedstock/recipe/recipe.yaml

Lines 65 to 69 in c33b58a

    
           - if: is_cross_compiling 
        
             then: 
        
             - python 
        
             - cross-python_${{ target_platform }} 
        
             - pytorch ==${{ pytorch_version }}

or you need extra config to ensure the right python is found (see logs on main).

Also, developing/debugging with one python version only (quoting from #6):

In general, please be aware that you're using a very rare resource that's needed by other feedstocks as well. In particular, while developing a PR for some change, you should add
build:
  skip: match(python, "!=312")
(or however this is spelled in v1; still don't know it by heart). Once everything is green you can remove the skip to build out all version only once before merging.

Automerge removed!

In the same vein, please be mindful of not creating superfluous runs on main, but rather combine changes into a single PR where appropriate.

Working on the opengpu server needs some special rules you won't necessarily encounter in the rest of conda-forge, please don't blindly apply previously known patterns, but think about how you can use this resource judiciously and effectively.

shermansiu · 2025-07-30T06:46:29Z

OK, that's good to know, thanks!

conda-forge-admin · 2025-07-30T06:50:05Z

Hi! This is the friendly automated conda-forge-linting service.

I was trying to look for recipes to lint for you, but it appears we have a merge conflict. Please try to merge or rebase with the base branch to resolve this conflict.

Please ping the 'conda-forge/core' team (using the @ notation in a comment) if you believe this is a bug.

shermansiu · 2025-07-30T06:52:57Z

@conda-forge-admin, please rerender

conda-forge-admin · 2025-07-30T06:54:24Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/recipe.yaml) and found it was in an excellent condition.

I do have some suggestions for making it better though...

For recipe/recipe.yaml:

ℹ️ The recipe is not parsable by parser conda-recipe-manager. The recipe can only be automatically migrated to the new v1 format if it is parseable by conda-recipe-manager.

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/16616126486. Examine the logs at this URL for more detail.}

h-vetinari · 2025-07-30T08:21:00Z

Note that CI cannot start for the bot commit. Any commit you want to run on the opengpu server needs to be by an authorized user (and the bot is not, intentionally).

So you either need to push some other pending change / clean-up, or simply an empty commit, which you can create using git commit --allow-empty -m "trigger CI"

conda-forge-admin · 2025-08-01T11:12:14Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe/recipe.yaml) and found it was in an excellent condition.

shermansiu · 2025-08-01T11:13:15Z

@conda-forge-admin, please rerender

shermansiu · 2025-08-01T11:43:50Z

Depends on conda-forge/pytorch-cpu-feedstock#405

shermansiu · 2025-08-31T21:37:40Z

May actually depend on conda-forge/pytorch-cpu-feedstock#408 now

h-vetinari · 2025-08-31T22:05:15Z

May actually depend on conda-forge/pytorch-cpu-feedstock#408 now

It shouldn't require a 12.9 build of pytorch. Or what aspect of that PR do you think is necessary?

shermansiu · 2025-08-31T22:44:48Z

It shouldn't rely on it, but for some reason, the build seems to fail for the CUDA 12.6 build of PyTorch.

shermansiu · 2025-09-02T00:28:45Z

@conda-forge-admin, please rerender

CUDA 12.8 added support for architectures `sm_100`, `sm_101` and `sm_120`, while CUDA 12.9 further added `sm_103` and `sm_121`. To build for these, maintainers will need to modify their existing list of specified architectures (e.g. `CMAKE_CUDA_ARCHITECTURES`, `TORCH_CUDA_ARCH_LIST`, etc.) for their package. A good balance between broad support and storage footprint (resp. compilation time) is to add `sm_100` and `sm_120`. Since CUDA 12.8, the conda-forge nvcc package now sets `CUDAARCHS` and `TORCH_CUDA_ARCH_LIST` in its activation script to a string containing all of the supported real architectures plus the virtual architecture of the latest. Recipes for packages who use these variables to control their build but do not want to build for all supported architectures will need to override these variables in their build script. ref: https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#new-features

…5.09.01.21.18.50

…5.09.02.23.06.43 Other tools: - conda-build 25.7.0 - rattler-build 0.46.0 - rattler-build-conda-compat 1.4.5

h-vetinari · 2025-09-03T10:13:50Z

OK, so vllm has some quite horrible homespun logic for CUDA arch selection, which produces something that's IMO highly questionable:

CUDA target architectures: 5.0;5.2;6.0;6.1;7.0;7.5;8.0;8.6;8.9;9.0;10.0;10.1;10.3;12.0;12.1

We've had problems with sm_90, in some places; let's try with a reduced set for now and see if that builds. Apparently the only way to pass this in is through a very verbose "interface" like

CUDA_ARCH_FLAGS="-gencode arch=compute_70,code=sm_70;-gencode arch=compute_75,code=sm_75"

…e-intensive

…-forge-pinning 2025.09.15.11.24.57

…5.09.16.07.18.04 Other tools: - conda-build 25.7.0 - rattler-build 0.47.0 - rattler-build-conda-compat 1.4.6

shermansiu · 2025-09-16T08:41:08Z

Thanks for adding the Python 3.13 skip back in!

…. MNT: Re-rendered with conda-smithy 3.52.2 and conda-forge-pinning 2025.09.16.07.18.04

shermansiu · 2025-09-16T20:05:31Z

If the Python 3.10 build works, I'll probably just skip the CI for the final commit for re-enabling everything and just merge it in to prevent the same jobs from being run in the PR and in the main branch. I'll just double-check that the required jobs are created.

….2 and conda-forge-pinning 2025.09.16.07.18.04

regro-cf-autotick-bot requested review from maresb and shermansiu as code owners July 29, 2025 22:56

h-vetinari mentioned this pull request Jul 30, 2025

Add cross-compilation for arm64 and aarch64 #6

Merged

5 tasks

h-vetinari marked this pull request as draft July 30, 2025 03:15

shermansiu force-pushed the rebuild-cuda129-0-1_he0d167 branch from 2c59cb7 to bd9a435 Compare July 30, 2025 06:52

h-vetinari mentioned this pull request Jul 30, 2025

Update version to 0.9.2 #10

Merged

5 tasks

shermansiu force-pushed the rebuild-cuda129-0-1_he0d167 branch from 2a4551c to 2667b5d Compare August 1, 2025 11:10

shermansiu force-pushed the rebuild-cuda129-0-1_he0d167 branch from 1ae5ef7 to bb35793 Compare September 2, 2025 00:28

regro-cf-autotick-bot and others added 2 commits September 2, 2025 00:04

[ci skip] Focus on a single build for now

244cc1a

shermansiu and others added 4 commits September 2, 2025 16:13

[ci skip] Just build CUDA for now.

4fadb80

MNT: Re-rendered with conda-smithy 3.52.1 and conda-forge-pinning 202…

553e6cb

…5.09.01.21.18.50

try with CUDA 12.8

dab443b

MNT: Re-rendered with conda-smithy 3.52.1 and conda-forge-pinning 202…

5021c6f

…5.09.02.23.06.43 Other tools: - conda-build 25.7.0 - rattler-build 0.46.0 - rattler-build-conda-compat 1.4.5

set CUDA_ARCH_FLAGS

2db3dbf

h-vetinari force-pushed the rebuild-cuda129-0-1_he0d167 branch from 5054593 to 2db3dbf Compare September 3, 2025 10:15

shermansiu and others added 16 commits September 14, 2025 06:37

[ci skip] Fix indentation

2fc69d1

Override CUDA archs

67c1702

Focus on CUDA 7.0 architecture for now

3e461e0

[ci skip] Use real list of CUDA archs

9a0268e

Use CUDA 12.9 for build

ed3c56e

[ci skip] Revert changes to CUDA 12.9 migration

72161f5

[ci skip] Try building a smaller set of CUDA architectures for now

3d564c4

Trigger build.

95582a3

Try to just build for CUDA 8.0 arch

3e606e5

[ci skip] Compress CUDA wheel

467a4b0

[ci skip] Use a single thread for CUDA compilation to be less resourc…

463e0e0

…e-intensive

Compile for a few more 8.* archs

092b7a0

Try with all archs but 4 max jobs

288576e

Build all wheels. MNT: Re-rendered with conda-smithy 3.52.2 and conda…

bd2191a

…-forge-pinning 2025.09.15.11.24.57

reinstate skip for py3.13

3c046a7

MNT: Re-rendered with conda-smithy 3.52.2 and conda-forge-pinning 202…

9e51458

…5.09.16.07.18.04 Other tools: - conda-build 25.7.0 - rattler-build 0.47.0 - rattler-build-conda-compat 1.4.6

Reduce max jobs to 3 and try to build the Python 3.10 wheel with CUDA…

97e63f6

…. MNT: Re-rendered with conda-smithy 3.52.2 and conda-forge-pinning 2025.09.16.07.18.04

shermansiu force-pushed the rebuild-cuda129-0-1_he0d167 branch from 471e0fb to 97e63f6 Compare September 16, 2025 18:54

[ci skip] Rebuild all wheels. MNT: Re-rendered with conda-smithy 3.52…

84b36d6

….2 and conda-forge-pinning 2025.09.16.07.18.04

shermansiu merged commit 827b7b7 into conda-forge:main Sep 17, 2025
2 checks passed

regro-cf-autotick-bot deleted the rebuild-cuda129-0-1_he0d167 branch September 17, 2025 06:08

Uh oh!

Conversation

regro-cf-autotick-bot commented Jul 29, 2025 • edited by shermansiu Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

conda-forge-admin commented Jul 29, 2025

Uh oh!

shermansiu commented Jul 29, 2025

Uh oh!

conda-forge-admin commented Jul 29, 2025

Uh oh!

h-vetinari commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

h-vetinari commented Jul 30, 2025

Uh oh!

h-vetinari commented Jul 30, 2025

Uh oh!

shermansiu commented Jul 30, 2025

Uh oh!

h-vetinari commented Jul 30, 2025

Uh oh!

shermansiu commented Jul 30, 2025

Uh oh!

conda-forge-admin commented Jul 30, 2025

Uh oh!

shermansiu commented Jul 30, 2025

Uh oh!

conda-forge-admin commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

h-vetinari commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

conda-forge-admin commented Aug 1, 2025

Uh oh!

shermansiu commented Aug 1, 2025

Uh oh!

shermansiu commented Aug 1, 2025

Uh oh!

shermansiu commented Aug 31, 2025

Uh oh!

h-vetinari commented Aug 31, 2025

Uh oh!

shermansiu commented Aug 31, 2025

Uh oh!

shermansiu commented Sep 2, 2025

Uh oh!

h-vetinari commented Sep 3, 2025

Uh oh!

shermansiu commented Sep 16, 2025

Uh oh!

shermansiu commented Sep 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

regro-cf-autotick-bot commented Jul 29, 2025 •

edited by shermansiu

Loading

h-vetinari commented Jul 30, 2025 •

edited

Loading

conda-forge-admin commented Jul 30, 2025 •

edited

Loading

h-vetinari commented Jul 30, 2025 •

edited

Loading