Add vllm by maresb · Pull Request #28931 · conda-forge/staged-recipes

maresb · 2025-01-25T05:02:48Z

~~Very rough draft. I will almost certainly require help.~~

Opened on the advice of @h-vetinari in conda-forge/xformers-feedstock#42

Direct and transitive dependencies:

compressed-tensors Add compressed-tensors #28778
partial-json-parser Add partial-json-parser #28793
outlines-core Outlines core #28863
outlines Build for v0.1.11 outlines-feedstock#33
airportsdata Add airportsdata #28861
depyf Add depyf #28865
libnacl Add libnacl #28930
tensorizer Add tensorizer #28932
xgrammar Adding xgrammar #29064

Closes #24710
Fixes #29105

Checklist

github-actions · 2025-01-25T05:04:11Z

Hi! This is the staged-recipes linter and your PR looks excellent! 🚀

conda-forge-admin · 2025-01-25T05:04:23Z

Hi! This is the friendly automated conda-forge-linting service.

I wanted to let you know that I linted all conda-recipes in your PR (recipes/vllm/recipe.yaml) and found some lint.

Here's what I've got...

For recipes/vllm/recipe.yaml:

❌ license_file entry is missing, but is required.
❌ Non noarch packages should have python requirement without any version constraints.
❌ Non noarch packages should have python requirement without any version constraints.

For recipes/vllm/recipe.yaml:

ℹ️ Please depend on pytorch directly. If your package definitely requires the CUDA version, please depend on pytorch =*=cuda*.
ℹ️ Use importlib-metadata instead of importlib_metadata
ℹ️ PyPI default URL is now pypi.org, and not pypi.io. You may want to update the default source url.

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/12962027449. Examine the logs at this URL for more detail.}

conda-forge-admin · 2025-01-25T05:20:54Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipes/vllm/recipe.yaml) and found it was in an excellent condition.

maresb · 2025-01-25T14:41:57Z

Interesting, we're getting different results between CUDA 11.8 and 12.0.

Both fail in the following command:

['cmake', '$SRC_DIR', '-G', 'Ninja', '-DCMAKE_BUILD_TYPE=RelWithDebInfo', 
'-DVLLM_TARGET_DEVICE=cuda', '-DVLLM_PYTHON_EXECUTABLE=$PREFIX/bin/python', 
'-DVLLM_PYTHON_PATH=$PREFIX/lib/python3.9/site-packages/pip/_vendor/pyproject_hooks/_in_process:$PREFIX/lib/python39.zip:$PREFIX/lib/python3.9:$PREFIX/lib/python3.9/lib-dynload:$PREFIX/lib/python3.9/site-packages:$PREFIX/lib/python3.9/site-packages/setuptools/_vendor', 
'-DFETCHCONTENT_BASE_DIR=$SRC_DIR/.deps', '-DNVCC_THREADS=1',
'-DCMAKE_JOB_POOL_COMPILE:STRING=compile', '-DCMAKE_JOB_POOLS:STRING=compile=2']

12.0 fails earlier at CUDA detection:

 │ │       -- Caffe2: Found protobuf with new-style protobuf targets.
 │ │       -- Caffe2: Protobuf version 28.2.0
 │ │       -- Could NOT find CUDA (missing: CUDA_INCLUDE_DIRS) (found version "12.0")
 │ │       CMake Warning at $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:31 (message):
 │ │         Caffe2: CUDA cannot be found.  Depending on whether you are building Caffe2
 │ │         or a Caffe2 dependent library, the next warning / error will give you more
 │ │         info.
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:84 (find_package)
 │ │       
 │ │       
 │ │       CMake Error at $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:90 (message):
 │ │         Your installed Caffe2 version uses CUDA but I cannot find the CUDA
 │ │         libraries.  Please set the proper CUDA prefixes and / or install CUDA.
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:84 (find_package)

11.8 gets further:

 │ │       -- Caffe2: Found protobuf with new-style protobuf targets.
 │ │       -- Caffe2: Protobuf version 28.2.0
 │ │       -- Found CUDA: /usr/local/cuda (found version "11.8")
 │ │       -- The CUDA compiler identification is NVIDIA 11.8.89 with host compiler GNU 11.4.0
 │ │       -- Detecting CUDA compiler ABI info
 │ │       -- Detecting CUDA compiler ABI info - done
 │ │       -- Check for working CUDA compiler: $PREFIX/bin/nvcc - skipped
 │ │       -- Detecting CUDA compile features
 │ │       -- Detecting CUDA compile features - done
 │ │       -- Found CUDAToolkit: /usr/local/cuda/include (found version "11.8.89")
 │ │       -- Caffe2: CUDA detected: 11.8
 │ │       -- Caffe2: CUDA nvcc is: /usr/local/cuda/bin/nvcc
 │ │       -- Caffe2: CUDA toolkit directory: /usr/local/cuda
 │ │       -- Caffe2: Header version is: 11.8
 │ │       -- Found Python: $PREFIX/bin/python (found version "3.9.21") found components: Interpreter
 │ │       CMake Warning at $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message):
 │ │         Failed to compute shorthash for libnvrtc.so
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:84 (find_package)
 │ │       
 │ │       
 │ │       CMake Warning (dev) at $PREFIX/share/cmake-3.31/Modules/FindPackageHandleStandardArgs.cmake:441 (message):
 │ │         The package name passed to `find_package_handle_standard_args` (nvtx3) does
 │ │         not match the name of the calling package (Caffe2).  This can lead to
 │ │         problems in calling code that expects `find_package` result variables
 │ │         (e.g., `_FOUND`) to follow a certain pattern.
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:174 (find_package_handle_standard_args)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:84 (find_package)
 │ │       This warning is for project developers.  Use -Wno-dev to suppress it.
 │ │       
 │ │       -- Could NOT find nvtx3 (missing: nvtx3_dir)
 │ │       CMake Warning at $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:180 (message):
 │ │         Cannot find NVTX3, find old NVTX instead
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:84 (find_package)
 │ │       
 │ │       
 │ │       -- USE_CUDNN is set to 0. Compiling without cuDNN support
 │ │       -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support
 │ │       -- USE_CUDSS is set to 0. Compiling without cuDSS support
 │ │       -- USE_CUFILE is set to 0. Compiling without cuFile support
 │ │       -- Automatic GPU detection failed. Building for common architectures.
 │ │       -- Autodetected CUDA architecture(s): 3.5;5.0;8.0;8.6;8.9;9.0;8.9+PTX;9.0+PTX
 │ │       -- Added CUDA NVCC flags for: -gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_86,code=sm_86;-gencode;arch=compute_89,code=sm_89;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_89,code=compute_89;-gencode;arch=compute_90,code=compute_90
 │ │       -- Found Torch: $PREFIX/lib/libtorch.so
 │ │       -- CUDA target architectures: 3.5;5.0;8.0;8.6;8.9;9.0
 │ │       -- CUDA supported target architectures: 8.0;8.6;8.9;9.0
 │ │       -- FetchContent base directory: $SRC_DIR/.deps
 │ │       CMake Error at $PREFIX/share/cmake-3.31/Modules/ExternalProject/shared_internal_commands.cmake:943 (message):
 │ │         error: could not find git for clone of cutlass-populate
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/share/cmake-3.31/Modules/ExternalProject.cmake:3041 (_ep_add_download_command)
 │ │         CMakeLists.txt:29 (ExternalProject_Add)
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:84 (find_package)
 │ │       
 │ │       
 │ │       -- USE_CUDNN is set to 0. Compiling without cuDNN support
 │ │       -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support
 │ │       -- USE_CUDSS is set to 0. Compiling without cuDSS support
 │ │       -- USE_CUFILE is set to 0. Compiling without cuFile support
 │ │       -- Automatic GPU detection failed. Building for common architectures.
 │ │       -- Autodetected CUDA architecture(s): 3.5;5.0;8.0;8.6;8.9;9.0;8.9+PTX;9.0+PTX
 │ │       -- Added CUDA NVCC flags for: -gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_86,code=sm_86;-gencode;arch=compute_89,code=sm_89;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_89,code=compute_89;-gencode;arch=compute_90,code=compute_90
 │ │       -- Found Torch: $PREFIX/lib/libtorch.so
 │ │       -- CUDA target architectures: 3.5;5.0;8.0;8.6;8.9;9.0
 │ │       -- CUDA supported target architectures: 8.0;8.6;8.9;9.0
 │ │       -- FetchContent base directory: $SRC_DIR/.deps
 │ │       CMake Error at $PREFIX/share/cmake-3.31/Modules/ExternalProject/shared_internal_commands.cmake:943 (message):
 │ │         error: could not find git for clone of cutlass-populate
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/share/cmake-3.31/Modules/ExternalProject.cmake:3041 (_ep_add_download_command)
 │ │         CMakeLists.txt:29 (ExternalProject_Add)
 │ │       
 │ │       
 │ │       -- Configuring incomplete, errors occurred!

conda-forge-admin · 2025-01-25T19:04:34Z

Hi! This is the friendly automated conda-forge-linting service.

I wanted to let you know that I linted all conda-recipes in your PR (recipes/vllm/recipe.yaml) and found some lint.

Here's what I've got...

For recipes/vllm/recipe.yaml:

❌ Selectors in comment form no longer work in v1 recipes. Instead, if / then / else maps must be used. See lines [39, 41, 46, 48, 49].

_{This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/12967644561. Examine the logs at this URL for more detail.}

conda-forge-admin · 2025-01-25T19:07:58Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipes/vllm/recipe.yaml) and found it was in an excellent condition.

h-vetinari · 2025-01-26T04:22:33Z

Thanks @maresb! I had forgotten that there's already #24710, perhaps @mediocretech would be interested in collaborating?

W.r.t. CUDA, we need to move on from 12.0 here, which isn't used anywhere else in conda-forge anymore - it's just that staged-recipes seems to have been forgotten in the context of conda-forge/conda-forge-pinning-feedstock#6630.

maresb · 2025-01-26T04:50:32Z

Oh, I didn't notice that effort, thanks @h-vetinari! Although that's old it looks like @rongou is eager to help! 🚀

Do you think that CUDA 12.0 is actually causing a problem here? I was thinking (i.e. wildly guessing) that we need to patch CMakeLists.txt, but I've never used cmake. 😞

h-vetinari · 2025-01-26T05:02:34Z

Mainly I want to avoid redundant work. As soon as #28938 is in and we have merged main here, I'll be happy to take a look what's going on.

h-vetinari · 2025-01-26T05:05:25Z

In any case, you'll have to address

 │ │       CMake Error at $PREFIX/share/cmake-3.31/Modules/ExternalProject/shared_internal_commands.cmake:943 (message):
 │ │         error: could not find git for clone of cutlass-populate

maresb · 2025-01-27T01:17:48Z

Woah, after adding git as a host dependency it's compiling on CUDA 11.8 until it runs out of memory and crashes. Maybe I can add some swap. CUDA 12.0 is still not being discovered.

...
 │ │ Building wheels for collected packages: vllm
 │ │   Building wheel for vllm (pyproject.toml): started
 │ │   Building wheel for vllm (pyproject.toml): still running...
...
 │ │   Building wheel for vllm (pyproject.toml): still running...
##[warning]Free memory is lower than 5%; Currently used: 95.80%
##[warning]Free memory is lower than 5%; Currently used: 95.80%
##[warning]Free memory is lower than 5%; Currently used: 95.80%
##[warning]Free memory is lower than 5%; Currently used: 95.80%
 │ │   Building wheel for vllm (pyproject.toml): still running...
 │ │   Building wheel for vllm (pyproject.toml): still running...
 │ │   Building wheel for vllm (pyproject.toml): still running...

github-actions · 2025-01-27T01:23:11Z

Hi! This is the staged-recipes linter and I found some lint.

It looks like some changes were made outside the recipes/ directory. To ensure everything runs smoothly, please make sure that recipes are only added to the recipes/ directory and no other files are changed.

If these changes are intentional (and you aren't submitting a recipe), please add a maintenance label to the PR.

File-specific lints and/or hints:

.scripts/debug_osx_arch.sh:
- lints:
  - Do not edit files outside of the recipes/ directory.
conda-forge.yml:
- lints:
  - Do not edit files outside of the recipes/ directory.
.scripts/new_run_osx_build.sh:
- lints:
  - Do not edit files outside of the recipes/ directory.
.scripts/new_run_docker_build.sh:
- lints:
  - Do not edit files outside of the recipes/ directory.

maresb · 2025-01-27T01:23:14Z

Ah, hmm, I just added swap to conda-forge.yml. Not sure how that's supposed to work here on staged-recipes. 🤔

EDIT: Oh good, the linter is complaining, so that will help us to remember to revert it before merging.

EDIT2: Hmm, it seems that the swap setting works on linux_64 but fails on linux_64_cuda_*:

maresb · 2025-01-29T17:44:50Z

Hi @h-vetinari!

As soon as #28938 is in and we have merged main here, I'll be happy to take a look what's going on.

As a brief summary of the above, I merged main into this branch after #28938 was merged into main. It didn't seem to change anything with respect to the errors.

On CUDA 12.x I'm hitting the error:

Your installed Caffe2 version uses CUDA but I cannot find the CUDA
 │ │         libraries.  Please set the proper CUDA prefixes and / or install CUDA

On 11.8, after adding git as a host dependency, compilation starts but it runs out of memory. I tried to add swap by editing conda-forge.yml, but it didn't apply to the CUDA builds.

I'd be grateful for any advice you could provide. Thanks!

h-vetinari · 2025-01-29T23:31:11Z

On CUDA 12.x I'm hitting the error:

We're (now) aware of the CUDA-angle of conda-forge/pytorch-cpu-feedstock#333

On 11.8, after adding git as a host dependency, compilation starts but it runs out of memory. I tried to add swap by editing conda-forge.yml, but it didn't apply to the CUDA builds.

#28979

maresb · 2025-02-01T00:18:48Z

I would have hoped to get more out of setting VERBOSE=1. The only logs I get are:

 │ │   Building wheel for vllm (pyproject.toml): still running...

VERBOSE=1 is supposed to add the flag -DCMAKE_VERBOSE_MAKEFILE=ON. Not sure what exactly that does.

Here's the corresponding Python code to go from the envvar to get the flag:

https://github.com/vllm-project/vllm/blob/a1fc18c030e4d0466f2b23cb7dd4d11ce4b85603/vllm/envs.py#L138-L140

https://github.com/vllm-project/vllm/blob/a1fc18c030e4d0466f2b23cb7dd4d11ce4b85603/setup.py#L132-L134

shermansiu · 2025-02-12T11:36:42Z

Hmm, it still appears broken after conda-forge/pytorch-cpu-feedstock#339.

Could NOT find CUDA (missing: CUDA_INCLUDE_DIRS) (found version "12.6")

Is CUDA_INCLUDE_DIRS properly set?

shermansiu · 2025-02-12T11:39:02Z

The CUDA 11.8 build probably fails because it's out of disk space and/or RAM, but that's just speculation:

##[warning]Free disk space on / is lower than 5%; Currently used: 95.08% (x5)

##[warning]Free memory is lower than 5%; Currently used: 96.11% (x5)

maresb · 2025-02-12T13:55:25Z

Hey @shermansiu, great to have you around!!!

I'm a bit lost since I'm not very familiar with CUDA.

I was just now having some trouble getting the CI to rerun the CUDA builds, but rebasing seems to have fixed it.

Also, post-rebase things seem to be proceeding slightly further for 12.6:

 │ │       -- Caffe2: Found protobuf with new-style protobuf targets.
 │ │       -- Caffe2: Protobuf version 28.3.0
 │ │       -- Unable to find cublas_v2.h in either "$PREFIX/targets/x86_64-linux/include" or "$PREFIX/math_libs/include"
 │ │       -- Found CUDAToolkit: $PREFIX/targets/x86_64-linux/include (found version "12.6.85")
 │ │       -- Check for working CUDA compiler: $PREFIX/bin/nvcc - skipped
 │ │       -- Detecting CUDA compile features
 │ │       -- Detecting CUDA compile features - done
 │ │       -- Unable to find cublas_v2.h in either "$PREFIX/targets/x86_64-linux/include" or "$PREFIX/math_libs/include"
 │ │       -- Caffe2: CUDA detected: 12.6.85
 │ │       -- Caffe2: CUDA nvcc is: $PREFIX/bin/nvcc
 │ │       -- Caffe2: CUDA toolkit directory:
 │ │       -- Caffe2: Header version is: 12.6
 │ │       CMake Error at $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:107 (get_target_property):
 │ │         get_target_property() called with non-existent target "CUDA::nvrtc".
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:81 (find_package)
 │ │       
 │ │       
 │ │       -- Found Python: $PREFIX/bin/python (found version "3.9.21") found components: Interpreter
 │ │       CMake Warning at $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:116 (message):
 │ │         Failed to compute shorthash for libnvrtc.so
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:81 (find_package)
 │ │       
 │ │       
 │ │       CMake Warning (dev) at $PREFIX/share/cmake-3.31/Modules/FindPackageHandleStandardArgs.cmake:441 (message):
 │ │         The package name passed to `find_package_handle_standard_args` (nvtx3) does
 │ │         not match the name of the calling package (Caffe2).  This can lead to
 │ │         problems in calling code that expects `find_package` result variables
 │ │         (e.g., `_FOUND`) to follow a certain pattern.
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:154 (find_package_handle_standard_args)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:81 (find_package)
 │ │       This warning is for project developers.  Use -Wno-dev to suppress it.
 │ │       
 │ │       -- Could NOT find nvtx3 (missing: nvtx3_dir)
 │ │       CMake Warning at $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:160 (message):
 │ │         Cannot find NVTX3, find old NVTX instead
 │ │ Failed to build vllm
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
 │ │         CMakeLists.txt:81 (find_package)
 │ │       
 │ │       
 │ │       -- USE_CUDNN is set to 0. Compiling without cuDNN support
 │ │       -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support
 │ │       -- USE_CUDSS is set to 0. Compiling without cuDSS support
 │ │       -- USE_CUFILE is set to 0. Compiling without cuFile support
 │ │       -- Added CUDA NVCC flags for:
 │ │       CMake Warning at $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message):
 │ │         static library kineto_LIBRARY-NOTFOUND not found.
 │ │       Call Stack (most recent call first):
 │ │         $PREFIX/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:120 (append_torchlib_if_found)
 │ │         CMakeLists.txt:81 (find_package)
 │ │       
 │ │       
 │ │       -- Found Torch: $PREFIX/lib/libtorch.so
 │ │       CMake Error at CMakeLists.txt:122 (message):
 │ │         Can't find CUDA or HIP installation.

I'm not too sure what this means or how to fix it. I'd be very grateful for any suggestions.

shermansiu · 2025-02-12T14:39:44Z

Hmm, I'd like to build the recipe locally to diagnose this further, but at a glance, the following line looks a bit concerning:

 -- Unable to find cublas_v2.h in either "$PREFIX/targets/x86_64-linux/include" or "$PREFIX/math_libs/include"

h-vetinari

Well, you need more than just {{ compiler("cuda") }} to get what all the CUDA components you need.

Look like you need at minimum

    - cuda-version =={{ cuda_compiler_version }}
    - cuda-cudart-dev
    - cuda-nvrtc-dev
    - libcublas-dev

in the host environment. Also note that we're still figuring out an issue with nvtx, see conda-forge/pytorch-cpu-feedstock#357

h-vetinari · 2025-02-12T23:52:48Z

+  - cmake
+  - git
+  - ${{ stdlib('c') }}
+  - ${{ compiler('c') }}
+  - ${{ compiler('cxx') }}
+  - ${{ compiler('cuda') }}


All this (+ninja) should move to the build environment.

shermansiu

This seems to resolve the nvtx issue, but then it complains about not being able to find kineto.

Using USE_KINETO=0 doesn't seem to work because the existing PyTorch .cmake files in the environment already have kineto enabled.

lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake

if(ON)
  append_torchlib_if_found(kineto)
endif()

See:

shermansiu · 2025-07-21T08:22:27Z

Thanks so much, h-vetenari, for your incredibly detailed review!

h-vetinari

getting there!

h-vetinari

This PR basically LGTM now! Thanks for all the hard work! It's still marked as a draft, are you missing anything else? Some tasks related to the gpu server are still ahead, and some suggestions for further improvements below.

shermansiu · 2025-07-24T16:53:41Z

Thanks for looking at this, @h-vetinari! I should be able to get to the rest of the things later in the week, hopefully!

shermansiu · 2025-07-28T21:00:16Z

If there are no other requested changes, I'm good to have this merged! The pull request is no longer just a draft, but I don't have the permissions to change it.

shermansiu · 2025-07-28T21:02:26Z

@maresb Please add "Closes #24710" and "fixes #29105" to the initial PR description so that we can close the other vllm PR and the package request issue automatically once this gets merged

shermansiu · 2025-07-28T21:15:57Z

Anyways, the CUDA 12.6 build works locally and the tests pass:

 │ Installing test environment
 │ ✔ Successfully updated the test environment
 │ Testing commands:
 │ ============================= test session starts ==============================
 │ platform linux -- Python 3.10.18, pytest-8.4.1, pluggy-1.6.0
 │ rootdir: $PREFIX/etc/conda/test-files/vllm/2
 │ plugins: anyio-4.9.0
 │ collected 22 items
 │ vllm/tests/core/test_scheduler.py ......................                 [100%]
 │ ============================== 22 passed in 6.97s ==============================
 │
 ╰─────────────────── (took 72 seconds)
 ✔ all tests passed!

h-vetinari

This is still marked as a draft (intentional?), and you'll have to do the procedures to get the rights to the opengpu server, but the PR itself LGTM!

shermansiu · 2025-07-29T03:39:35Z

"Only those with write access to this repository can mark a draft pull request as ready for review." - I'm unable to change this!

shermansiu · 2025-07-29T05:32:51Z

Please add "Closes #24710" and "fixes #29105" to the initial PR description so that we can close the other vllm PR and the package request issue automatically once this gets merged

Thanks @h-vetinari for adding that in! 😄

maresb force-pushed the vllm branch from efddfe8 to bb3adea Compare January 25, 2025 19:13

maresb mentioned this pull request Feb 1, 2025

create swapfiles also for CUDA jobs #28979

Merged

maresb closed this Feb 4, 2025

maresb reopened this Feb 4, 2025

shermansiu mentioned this pull request Feb 12, 2025

Package request: vllm #29105

Closed

2 tasks

maresb closed this Feb 12, 2025

maresb reopened this Feb 12, 2025

maresb force-pushed the vllm branch from e4b4a56 to fd3cfd7 Compare February 12, 2025 13:45

h-vetinari reviewed Feb 12, 2025

View reviewed changes

shermansiu suggested changes Feb 13, 2025

View reviewed changes

Comment thread recipes/vllm/recipe.yaml

Comment thread recipes/vllm/recipe.yaml

shermansiu added 2 commits July 21, 2025 04:18

Use the PyTorch version from the context variable.

2fa6823

Remove CUDA 12-specific configurations

e00bbd3

Reorganize the dependencies

421a8c8

h-vetinari reviewed Jul 21, 2025

View reviewed changes

Comment thread recipes/vllm/recipe.yaml Outdated

Comment thread recipes/vllm/recipe.yaml Outdated

shermansiu added 4 commits July 21, 2025 05:12

Add authorship info and commit message to patches

343613b

Remove obsolete skip

b63fdec

Update numba requirements and comment

95c2672

Remove manual C stdlib version configuration

f91ca88

h-vetinari approved these changes Jul 21, 2025

View reviewed changes

shermansiu added 7 commits July 28, 2025 01:15

Make the building more verbose

cf211a3

Remove megabuild config

ae38af8

Update entrypoint and add test

12306a0

Add Cutlass <4 requirement

32dfb34

Update patch message to contain additional info about gettid

141889f

Add some basic tests

0079207

Add --no-deps flag

c92d0fa

h-vetinari approved these changes Jul 29, 2025

View reviewed changes

h-vetinari marked this pull request as ready for review July 29, 2025 03:40

shermansiu mentioned this pull request Jul 29, 2025

Add large RAM GPU runner for vllm conda-forge/admin-requests#1596

Merged

6 tasks

maresb mentioned this pull request Jul 29, 2025

Add @maresb to the list of allowed users Quansight/open-gpu-server#64

Merged

2 tasks

h-vetinari merged commit ee83645 into conda-forge:main Jul 29, 2025
6 of 8 checks passed

maresb deleted the vllm branch July 30, 2025 05:23

maresb restored the vllm branch August 17, 2025 10:20

Uh oh!

Conversation

maresb commented Jan 25, 2025 • edited by h-vetinari Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jan 25, 2025

Uh oh!

conda-forge-admin commented Jan 25, 2025

Uh oh!

conda-forge-admin commented Jan 25, 2025

Uh oh!

maresb commented Jan 25, 2025

Uh oh!

conda-forge-admin commented Jan 25, 2025

Uh oh!

conda-forge-admin commented Jan 25, 2025

Uh oh!

h-vetinari commented Jan 26, 2025

Uh oh!

maresb commented Jan 26, 2025

Uh oh!

h-vetinari commented Jan 26, 2025

Uh oh!

h-vetinari commented Jan 26, 2025

Uh oh!

maresb commented Jan 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jan 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maresb commented Jan 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maresb commented Jan 29, 2025

Uh oh!

h-vetinari commented Jan 29, 2025

Uh oh!

maresb commented Feb 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shermansiu commented Feb 12, 2025

Uh oh!

shermansiu commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maresb commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shermansiu commented Feb 12, 2025

Uh oh!

h-vetinari left a comment

Choose a reason for hiding this comment

Uh oh!

h-vetinari Feb 12, 2025

Choose a reason for hiding this comment

Uh oh!

shermansiu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

shermansiu commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

h-vetinari left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

h-vetinari left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shermansiu commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

maresb commented Jan 25, 2025 •

edited by h-vetinari

Loading

maresb commented Jan 27, 2025 •

edited

Loading

github-actions Bot commented Jan 27, 2025 •

edited

Loading

maresb commented Jan 27, 2025 •

edited

Loading

maresb commented Feb 1, 2025 •

edited

Loading

shermansiu commented Feb 12, 2025 •

edited

Loading

maresb commented Feb 12, 2025 •

edited

Loading

shermansiu commented Jul 21, 2025 •

edited

Loading

shermansiu commented Jul 24, 2025 •

edited

Loading