Skip to content

Commit 751f638

Browse files
saturley-halljasonqinzhou
authored andcommitted
fix: update the tags for consistency and remove 0.4.1 refs (#3058)
Signed-off-by: Harrison King Saturley-Hall <[email protected]> Signed-off-by: Harrison Saturley-Hall <[email protected]> Signed-off-by: Jason Zhou <[email protected]>
1 parent b85c41e commit 751f638

File tree

43 files changed

+98
-98
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

43 files changed

+98
-98
lines changed

benchmarks/incluster/benchmark_job.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ spec:
1717
fsGroup: 1000
1818
containers:
1919
- name: benchmark-runner
20-
image: nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.5.0
20+
image: nvcr.io/nvidia/ai-dynamo/vllm-runtime:my-tag
2121
securityContext:
2222
allowPrivilegeEscalation: false
2323
capabilities:

benchmarks/nixl/nixl-benchmark-deployment.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ spec:
1818
- name: nvcr-imagepullsecret
1919
containers:
2020
- name: nixl-benchmark
21-
image: nvcr.io/nvidian/nim-llm-dev/vllm-runtime:nixlbench-e42c07a8
21+
image: my-registry/vllm-runtime:nixlbench-e42c07a8
2222
command: ["sh", "-c"]
2323
args:
2424
- "nixlbench -etcd_endpoints http://dynamo-platform-etcd:2379 --target_seg_type VRAM --initiator_seg_type VRAM && sleep infinity"

components/backends/sglang/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -130,7 +130,7 @@ uv pip install --prerelease=allow sglang[all]==0.4.9.post6
130130
<summary>Instructions</summary>
131131

132132
```bash
133-
docker pull nvcr.io/nvidia/ai-dynamo/sglang-runtime:0.3.2
133+
docker pull nvcr.io/nvidia/ai-dynamo/sglang-runtime:my-tag
134134
```
135135

136136
</details>

components/backends/sglang/deploy/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -92,7 +92,7 @@ Edit the template to match your environment:
9292

9393
```yaml
9494
# Update image registry and tag
95-
image: your-registry/sglang-runtime:your-tag
95+
image: my-registry/sglang-runtime:my-tag
9696

9797
# Configure your model
9898
args:

components/backends/sglang/deploy/disagg_planner.yaml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ spec:
1818
replicas: 1
1919
extraPodSpec:
2020
mainContainer:
21-
image: nvcr.io/nvidian/nim-llm-dev/sglang-runtime:hzhou-0811-1
21+
image: my-registry/sglang-runtime:my-tag
2222
Planner:
2323
dynamoNamespace: dynamo
2424
envFromSecret: hf-token-secret
@@ -49,7 +49,7 @@ spec:
4949
mountPoint: /data
5050
extraPodSpec:
5151
mainContainer:
52-
image: nvcr.io/nvidian/nim-llm-dev/sglang-runtime:hzhou-0811-1
52+
image: my-registry/sglang-runtime:my-tag
5353
workingDir: /workspace/components/planner/src/dynamo/planner
5454
command:
5555
- /bin/sh
@@ -89,7 +89,7 @@ spec:
8989
failureThreshold: 10
9090
extraPodSpec:
9191
mainContainer:
92-
image: nvcr.io/nvidian/nim-llm-dev/sglang-runtime:hzhou-0811-1
92+
image: my-registry/sglang-runtime:my-tag
9393
workingDir: /workspace/components/backends/sglang
9494
command:
9595
- /bin/sh
@@ -106,7 +106,7 @@ spec:
106106
gpu: "1"
107107
extraPodSpec:
108108
mainContainer:
109-
image: nvcr.io/nvidian/nim-llm-dev/sglang-runtime:hzhou-0811-1
109+
image: my-registry/sglang-runtime:my-tag
110110
workingDir: /workspace/components/backends/sglang
111111
command:
112112
- python3
@@ -137,7 +137,7 @@ spec:
137137
gpu: "1"
138138
extraPodSpec:
139139
mainContainer:
140-
image: nvcr.io/nvidian/nim-llm-dev/sglang-runtime:hzhou-0811-1
140+
image: my-registry/sglang-runtime:my-tag
141141
workingDir: /workspace/components/backends/sglang
142142
command:
143143
- python3

components/backends/trtllm/deploy/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -89,7 +89,7 @@ resources:
8989
```yaml
9090
extraPodSpec:
9191
mainContainer:
92-
image: nvcr.io/nvidian/nim-llm-dev/trtllm-runtime:dep-233.17
92+
image: my-registry/trtllm-runtime:my-tag
9393
workingDir: /workspace/components/backends/trtllm
9494
args:
9595
- "python3"
@@ -109,7 +109,7 @@ Before using these templates, ensure you have:
109109

110110
### Container Images
111111

112-
The deployment files currently require access to `nvcr.io/nvidian/nim-llm-dev/trtllm-runtime`. If you don't have access, build and push your own image:
112+
The deployment files currently require access to `my-registry/trtllm-runtime`. If you don't have access, build and push your own image:
113113

114114
```bash
115115
./container/build.sh --framework tensorrtllm
@@ -141,7 +141,7 @@ Edit the template to match your environment:
141141

142142
```yaml
143143
# Update image registry and tag
144-
image: your-registry/trtllm-runtime:your-tag
144+
image: my-registry/trtllm-runtime:my-tag
145145

146146
# Configure your model and deployment settings
147147
args:

components/backends/trtllm/deploy/agg-with-config.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ spec:
3434
replicas: 1
3535
extraPodSpec:
3636
mainContainer:
37-
image: nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:0.4.1
37+
image: nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:my-tag
3838
TRTLLMWorker:
3939
envFromSecret: hf-token-secret
4040
dynamoNamespace: trtllm-agg
@@ -50,7 +50,7 @@ spec:
5050
configMap:
5151
name: nvidia-config
5252
mainContainer:
53-
image: nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:0.4.1
53+
image: nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:my-tag
5454
workingDir: /workspace/components/backends/trtllm
5555
# mount the configmap as a volume
5656
volumeMounts:

components/backends/trtllm/deploy/agg.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ spec:
1313
replicas: 1
1414
extraPodSpec:
1515
mainContainer:
16-
image: nvcr.io/nvidian/nim-llm-dev/trtllm-runtime:dep-233.17
16+
image: my-registry/trtllm-runtime:my-tag
1717
TRTLLMWorker:
1818
envFromSecret: hf-token-secret
1919
dynamoNamespace: trtllm-agg
@@ -24,7 +24,7 @@ spec:
2424
gpu: "1"
2525
extraPodSpec:
2626
mainContainer:
27-
image: nvcr.io/nvidian/nim-llm-dev/trtllm-runtime:dep-233.17
27+
image: my-registry/trtllm-runtime:my-tag
2828
workingDir: /workspace/components/backends/trtllm
2929
command:
3030
- /bin/sh

components/backends/trtllm/deploy/agg_router.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ spec:
1313
replicas: 1
1414
extraPodSpec:
1515
mainContainer:
16-
image: nvcr.io/nvidian/nim-llm-dev/trtllm-runtime:dep-233.17
16+
image: my-registry/trtllm-runtime:my-tag
1717
envs:
1818
- name: DYN_ROUTER_MODE
1919
value: kv
@@ -27,7 +27,7 @@ spec:
2727
gpu: "1"
2828
extraPodSpec:
2929
mainContainer:
30-
image: nvcr.io/nvidian/nim-llm-dev/trtllm-runtime:dep-233.17
30+
image: my-registry/trtllm-runtime:my-tag
3131
workingDir: /workspace/components/backends/trtllm
3232
command:
3333
- /bin/sh

components/backends/trtllm/deploy/disagg.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ spec:
1313
replicas: 1
1414
extraPodSpec:
1515
mainContainer:
16-
image: nvcr.io/nvidian/nim-llm-dev/trtllm-runtime:dep-233.17
16+
image: my-registry/trtllm-runtime:my-tag
1717
TRTLLMPrefillWorker:
1818
dynamoNamespace: trtllm-disagg
1919
envFromSecret: hf-token-secret
@@ -24,7 +24,7 @@ spec:
2424
gpu: "1"
2525
extraPodSpec:
2626
mainContainer:
27-
image: nvcr.io/nvidian/nim-llm-dev/trtllm-runtime:dep-233.17
27+
image: my-registry/trtllm-runtime:my-tag
2828
workingDir: /workspace/components/backends/trtllm
2929
command:
3030
- /bin/sh
@@ -41,7 +41,7 @@ spec:
4141
gpu: "1"
4242
extraPodSpec:
4343
mainContainer:
44-
image: nvcr.io/nvidian/nim-llm-dev/trtllm-runtime:dep-233.17
44+
image: my-registry/trtllm-runtime:my-tag
4545
workingDir: /workspace/components/backends/trtllm
4646
command:
4747
- /bin/sh

0 commit comments

Comments
 (0)