Skip to content

Commit d0cff4c

Browse files
committed
Update comment and ref for vllm
1 parent 1ed877f commit d0cff4c

File tree

3 files changed

+3
-7
lines changed

3 files changed

+3
-7
lines changed

container/Dockerfile.vllm

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -16,10 +16,8 @@ ARG RUNTIME_IMAGE_TAG="12.8.1-runtime-ubuntu24.04"
1616
ARG VLLM_REF="77a6bf07aedf132aad2b6719f6d87abc5d3311ab"
1717
ARG TORCH_BACKEND="cu128"
1818

19-
# Match 0.10.0 vLLM release
20-
# https://github.com/vllm-project/vllm/releases/tag/v0.10.0
21-
# Pinned to commit before https://github.com/deepseek-ai/DeepGEMM/pull/112 for DeepGEMM which seems to break on H100:
22-
# "RuntimeError: Failed: CUDA runtime error csrc/jit/kernel_runtime.hpp:108 '98'"
19+
# Match 0.10.1 vLLM release
20+
# https://github.com/vllm-project/vllm/releases/tag/v0.10.1
2321
ARG DEEPGEMM_REF="f85ec64"
2422
ARG FLASHINF_REF="v0.2.8rc1"
2523

container/deps/vllm/install_vllm.sh

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -175,8 +175,6 @@ git checkout $DEEPGEMM_REF # Pin Version
175175
sed -i 's|[email protected]:|https://github.com/|g' .gitmodules
176176
git submodule sync --recursive
177177
git submodule update --init --recursive
178-
179-
# command for 03d0be3
180178
python setup.py install
181179

182180
# new install command for post 03d0be3

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,7 @@ trtllm =[
5656
vllm = [
5757
"uvloop",
5858
"nixl<=0.4.1",
59-
"vllm==0.10.0",
59+
{ git = "https://github.com/vllm-project/vllm.git", rev = "77a6bf07aedf132aad2b6719f6d87abc5d3311ab" },
6060
]
6161

6262
sglang = [

0 commit comments

Comments
 (0)