Skip to content

Commit 7e2521a

Browse files
authored
[None][chore] Some clean-ups for CUDA 13.0 dependencies (#7979)
Signed-off-by: Yanchao Lu <[email protected]>
1 parent 1eb6531 commit 7e2521a

File tree

5 files changed

+11
-12
lines changed

5 files changed

+11
-12
lines changed

docker/common/install_tensorrt.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -95,7 +95,7 @@ install_rockylinux_requirements() {
9595
"cuda-toolkit-config-common-${CUDA_RUNTIME}.noarch" \
9696
"libcublas-${CUBLAS_CUDA_VERSION}-${CUBLAS_VER}.${ARCH1}" \
9797
"libcublas-devel-${CUBLAS_CUDA_VERSION}-${CUBLAS_VER}.${ARCH1}"; do
98-
wget --retry-connrefused --timeout=180 --tries=10 --continue "https://developer.download.nvidia.cn/compute/cuda/repos/rhel8/${ARCH3}/${pkg}.rpm"
98+
wget --retry-connrefused --timeout=180 --tries=10 --continue "https://developer.download.nvidia.com/compute/cuda/repos/rhel8/${ARCH3}/${pkg}.rpm"
9999
done
100100

101101
# Remove old packages

docs/source/legacy/reference/support-matrix.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -152,9 +152,9 @@ The following table shows the supported software for TensorRT-LLM.
152152
* -
153153
- Software Compatibility
154154
* - Container
155-
- [25.06](https://docs.nvidia.com/deeplearning/frameworks/support-matrix/index.html)
155+
- [25.08](https://docs.nvidia.com/deeplearning/frameworks/support-matrix/index.html)
156156
* - TensorRT
157-
- [10.11](https://docs.nvidia.com/deeplearning/tensorrt/release-notes/index.html)
157+
- [10.13](https://docs.nvidia.com/deeplearning/tensorrt/release-notes/index.html)
158158
* - Precision
159159
-
160160
- Blackwell (SM100/SM120) - FP32, FP16, BF16, FP8, FP4, INT8, INT4

jenkins/L0_Test.groovy

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2262,7 +2262,7 @@ def launchTestJobs(pipeline, testFilter)
22622262
"B200_PCIe",
22632263
X86_64_TRIPLE,
22642264
false,
2265-
"dlfw/",
2265+
"cuda13/",
22662266
DLFW_IMAGE,
22672267
false,
22682268
],
@@ -2301,7 +2301,7 @@ def launchTestJobs(pipeline, testFilter)
23012301
"GH200",
23022302
AARCH64_TRIPLE,
23032303
false,
2304-
"dlfw/",
2304+
"cuda13/",
23052305
DLFW_IMAGE,
23062306
false,
23072307
],

requirements.txt

Lines changed: 5 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ accelerate>=1.7.0
44
build
55
colored
66
# cuda-python>=12,<13 # <For CUDA 12.9>
7-
cuda-python>=12
7+
cuda-python>=13
88
diffusers>=0.27.0
99
lark
1010
mpi4py
@@ -15,23 +15,22 @@ openai
1515
polygraphy
1616
psutil
1717
# nvidia-ml-py>=12,<13 # <For CUDA 12.9>
18-
nvidia-ml-py>=12
19-
# Just a wrapper since nvidia-modelopt requires pynvml
20-
pynvml==12.0.0
18+
nvidia-ml-py>=13
2119
pulp
2220
pandas
2321
h5py==3.12.1
2422
StrEnum
2523
sentencepiece>=0.1.99
26-
# tensorrt>=10.11.0,<=10.13.0 # <For CUDA 12.9>
24+
# tensorrt~=10.11.0 # <For CUDA 12.9>
2725
tensorrt~=10.13.0
2826
# https://docs.nvidia.com/deeplearning/frameworks/pytorch-release-notes/rel-25-08.html#rel-25-08 uses 2.8.0a0.
2927
# torch>=2.7.1,<=2.8.0a0 # <For CUDA 12.9>
3028
torch>=2.8.0a0,<=2.8.0
3129
torchvision
3230
nvidia-modelopt[torch]~=0.33.0
3331
# https://docs.nvidia.com/deeplearning/frameworks/pytorch-release-notes/rel-25-08.html#rel-25-08 uses 2.27.7
34-
nvidia-nccl-cu12
32+
# nvidia-nccl-cu12 # <For CUDA 12.9>
33+
nvidia-nccl-cu13
3534
# nvidia-cuda-nvrtc-cu12 # <For CUDA 12.9>
3635
nvidia-cuda-nvrtc
3736
transformers==4.56.0

tensorrt_llm/auto_parallel/cluster_info.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -176,7 +176,7 @@ class ClusterInfo(DictConversion):
176176
float32=60,
177177
),
178178
),
179-
# from https://images.nvidia.cn/content/Solutions/data-center/a40/nvidia-a40-datasheet.pdf
179+
# from https://images.nvidia.com/content/Solutions/data-center/a40/nvidia-a40-datasheet.pdf
180180
"A40":
181181
ClusterInfo(
182182
intra_node_bw_per_device=_bandwidths["PCIe-4"],

0 commit comments

Comments
 (0)