-
Notifications
You must be signed in to change notification settings - Fork 3.4k
chore(gb200): update dockerfile to handle fp4 disaggregation #8694
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -53,12 +53,10 @@ RUN mkdir -p /tmp/gdrcopy && cd /tmp \ | |
| RUN ln -sf /usr/lib/$(uname -m)-linux-gnu/libmlx5.so.1 /usr/lib/$(uname -m)-linux-gnu/libmlx5.so | ||
|
|
||
| # Clone and install SGLang | ||
| # NOTE: flashinfer v0.2.9rc1 is not installing for aarch64 | ||
| WORKDIR /sgl-workspace | ||
| RUN python3 -m pip install --no-cache-dir --upgrade pip setuptools wheel html5lib six \ | ||
| && git clone https://github.com/sgl-project/sglang.git \ | ||
| && git clone --depth 1 https://github.com/sgl-project/sglang.git \ | ||
| && cd sglang \ | ||
| && git checkout a167fd0bcb9ef4b0f4331a109e40c8cdc770b026 \ | ||
| && case "$CUDA_VERSION" in \ | ||
| 12.6.1) CUINDEX=126 ;; \ | ||
| 12.8.1) CUINDEX=128 ;; \ | ||
|
|
@@ -93,7 +91,7 @@ RUN wget https://developer.download.nvidia.com/compute/redist/nvshmem/3.3.9/sour | |
| # Python tools | ||
| RUN python3 -m pip install --no-cache-dir \ | ||
| datamodel_code_generator \ | ||
| mooncake_transfer_engine==0.3.5 \ | ||
| mooncake-transfer-engine==0.3.5 \ | ||
| pre-commit \ | ||
| pytest \ | ||
| black \ | ||
|
|
@@ -103,6 +101,15 @@ RUN python3 -m pip install --no-cache-dir \ | |
| wheel \ | ||
| scikit-build-core | ||
|
|
||
| # These will be automatically installed by future versions of flashinfer after 0.2.9rc2 | ||
| RUN python3 -m pip install --no-cache-dir \ | ||
| nvidia-cudnn-cu12 \ | ||
| nvidia-cudnn-frontend | ||
|
|
||
| # Allows for FP4 disaggregation | ||
| RUN python3 -m pip install --no-cache-dir \ | ||
| nixl | ||
|
|
||
|
Comment on lines
+104
to
+112
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. To optimize the Docker image size and improve build performance, it's a best practice to combine consecutive For further optimization, you could also merge these new packages into the preceding |
||
| # Install development tools and utilities | ||
| RUN apt-get update && apt-get install -y \ | ||
| gdb \ | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The
git checkoutto a specific commit has been removed. While using--depth 1is a good optimization for reducing the clone size, removing the pinned commit means this Docker build will now pull theHEADof thesglangrepository's default branch. This can lead to non-reproducible builds and may introduce breaking changes unexpectedly.For production stability and reproducible builds, it is highly recommended to pin dependencies to a specific git commit hash or tag.