[CI] Add sm_110 to aarch64 CUDA 13.0 builds#31544
Conversation
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run You ask your reviewers to trigger select CI tests on top of Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add If you have any questions, please reach out to us on Slack at https://slack.vllm.ai. 🚀 |
There was a problem hiding this comment.
Code Review
This pull request adds support for sm_110 to the aarch64 CUDA 13.0 build configuration. The change is straightforward, but I've identified a potential inconsistency in how future GPU architectures are specified in the torch_cuda_arch_list. My feedback includes a suggestion to use the +PTX suffix for all future architectures to ensure better forward-compatibility and maintain consistency within the build script.
| # #NOTE: torch_cuda_arch_list is derived from upstream PyTorch build files here: | ||
| # https://github.com/pytorch/pytorch/blob/main/.ci/aarch64_linux/aarch64_ci_build.sh#L7 | ||
| - "DOCKER_BUILDKIT=1 docker build --build-arg max_jobs=16 --build-arg USE_SCCACHE=1 --build-arg GIT_REPO_CHECK=1 --build-arg CUDA_VERSION=13.0.1 --build-arg torch_cuda_arch_list='8.7 8.9 9.0 10.0+PTX 12.0' --build-arg BUILD_BASE_IMAGE=nvidia/cuda:13.0.1-devel-ubuntu22.04 --tag vllm-ci:build-image --target build --progress plain -f docker/Dockerfile ." | ||
| - "DOCKER_BUILDKIT=1 docker build --build-arg max_jobs=16 --build-arg USE_SCCACHE=1 --build-arg GIT_REPO_CHECK=1 --build-arg CUDA_VERSION=13.0.1 --build-arg torch_cuda_arch_list='8.7 8.9 9.0 10.0+PTX 11.0 12.0' --build-arg BUILD_BASE_IMAGE=nvidia/cuda:13.0.1-devel-ubuntu22.04 --tag vllm-ci:build-image --target build --progress plain -f docker/Dockerfile ." |
There was a problem hiding this comment.
For consistency and forward-compatibility, it's best practice to use the +PTX suffix for future GPU architectures. This ensures that only PTX code is generated, which can be JIT-compiled by drivers on future hardware, rather than attempting to generate native SASS code which may not be possible with the current compiler version.
I notice that 10.0 is specified as 10.0+PTX, but the newly added 11.0 and the existing 12.0 lack this suffix. To maintain consistency and follow best practices, I recommend adding +PTX to both 11.0 and 12.0.
- "DOCKER_BUILDKIT=1 docker build --build-arg max_jobs=16 --build-arg USE_SCCACHE=1 --build-arg GIT_REPO_CHECK=1 --build-arg CUDA_VERSION=13.0.1 --build-arg torch_cuda_arch_list='8.7 8.9 9.0 10.0+PTX 11.0+PTX 12.0+PTX' --build-arg BUILD_BASE_IMAGE=nvidia/cuda:13.0.1-devel-ubuntu22.04 --tag vllm-ci:build-image --target build --progress plain -f docker/Dockerfile ."|
This pull request has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this pull request should remain open. Thank you! |
|
Any updates~? |
|
@xs-alt @NebulaTurnip27 I have absorbed it in #39878. |
|
Thank you!! |
|
This pull request has merge conflicts that must be resolved before it can be |
Purpose
This adds sm_110 (11.0) to the torch_cuda_arch_list for the aarch64 CUDA 13.0 wheel build introduced in #30341, ensuring the builds work on Jetson Thor.
Test Plan
building locally for sm110 works and has been working since 0.11.0.