feat: use generic image and use single node for oss-gpt-120b recipe by biswapanda · Pull Request #3454 · ai-dynamo/dynamo

biswapanda · 2025-10-07T06:32:14Z

Overview:

use generic image
use single node
fix hub path

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

Summary by CodeRabbit

Bug Fixes
- Corrected model path to ensure models load reliably.
Chores
- Switched container images to a private registry for improved deployment consistency.
- Adjusted default replica count to a single instance for streamlined deployments.
- Tuned performance configuration to use fewer GPUs, aligning perf runs with typical resource availability.

coderabbitai · 2025-10-07T06:36:17Z

Walkthrough

Updated TRT-LLM deployment configs: switched container image registry/tag, reduced replicas, adjusted MODEL_PATH prefix, and scaled down GPU count in perf settings. No new features or API changes.

Changes

Cohort / File(s)	Summary of Changes
TRT-LLM deployment configs `recipes/gpt-oss-120b/trtllm/agg/deploy.yaml`	Changed image from `nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:0.5.1-rc0.pre3` to `my-registry/trtllm-runtime:my-tag` in three containers; reduced replicas 18→1 across three sections; updated `MODEL_PATH` from `/model-store/models--openai--gpt-oss-120b/...` to `/model-store/hub/models--openai--gpt-oss-120b/...` in two places.
Perf scaling config `recipes/gpt-oss-120b/trtllm/agg/perf.yaml`	Reduced `DEPLOYMENT_GPU_COUNT` from "72" to "4", affecting derived concurrency and perf inputs; no other logic/path changes.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Poem

I thump my paws—new tags, fewer crews,
One GPU burrow, not seventy-two’s.
Paths hop to “hub,” replicas rest,
Leaner carrots, same warm nest.
In quiet racks, I twitch with glee—
Configs trimmed, swift as a bunny.

Pre-merge checks

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Description Check	⚠️ Warning	The pull request description includes the Overview and Related Issues sections but leaves the Details and Where should the reviewer start sections as unmodified placeholders, providing no concrete information about the specific changes or files that need attention, which makes the description incomplete for effective review.	Please populate the Details section with a clear summary of the file changes and update the Where should the reviewer start section to call out the specific files or areas that require focused review.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title Check	✅ Passed	The pull request title clearly and concisely summarizes the main changes by indicating the switch to a generic image and reducing to a single node for the oss-gpt-120b recipe, directly reflecting the core updates in the deployment configurations. It uses specific terminology and avoids vague language, making it easy for reviewers to grasp the primary intent at a glance. The title is appropriately scoped and aligned with the changeset.
Docstring Coverage	✅ Passed	No functions found in the changes. Docstring coverage check skipped.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

…3454)

Signed-off-by: Biswa Panda <biswa.panda@gmail.com>

…3454) Signed-off-by: Piotr Tarasiewicz <ptarasiewicz@nvidia.com>

…3454)

…i-dynamo#3454)

Cosmos3 pipelines are only in the unreleased vllm-omni PR vllm-project/vllm-omni#3454, not in any released wheel. Re-enable the git-install mechanism (reverted in 7744835) so the vllm-runtime container installs vllm-omni from the canonical repo pinned to the current PR head SHA (65b83d87, == refs/pull/3454/head). When vllm_omni_git_url is set, install_vllm_omni.sh installs "vllm-omni @ git+<url>@<ref>"; otherwise it falls back to the released "vllm-omni==<ref>" wheel. Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com> Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

feat: use generic image and use single node

7448483

biswapanda self-assigned this Oct 7, 2025

biswapanda requested a review from a team as a code owner October 7, 2025 06:32

biswapanda requested a review from a team October 7, 2025 06:32

biswapanda requested a review from a team as a code owner October 7, 2025 06:32

pull-request-size Bot added the size/S label Oct 7, 2025

github-actions Bot added the feat label Oct 7, 2025

biswapanda enabled auto-merge (squash) October 7, 2025 17:16

tmonty12 approved these changes Oct 7, 2025

View reviewed changes

biswapanda merged commit af7a41c into main Oct 7, 2025
20 checks passed

biswapanda deleted the bis/oss-gpt-120b-1node branch October 7, 2025 17:25

biswapanda added a commit that referenced this pull request Oct 7, 2025

feat: use generic image and use single node for oss-gpt-120b recipe (#…

975a071

…3454)

biswapanda mentioned this pull request Oct 7, 2025

feat: update gpt-oss 120b model recipe #3143 #3454 #3431

Merged

saturley-hall pushed a commit that referenced this pull request Oct 7, 2025

feat: update gpt-oss 120b model recipe #3143 #3454 (#3431)

99f1696

Signed-off-by: Biswa Panda <biswa.panda@gmail.com>

ptarasiewiczNV pushed a commit that referenced this pull request Oct 8, 2025

feat: use generic image and use single node for oss-gpt-120b recipe (#…

b67a9ac

…3454) Signed-off-by: Piotr Tarasiewicz <ptarasiewicz@nvidia.com>

nv-tusharma pushed a commit that referenced this pull request Oct 20, 2025

feat: use generic image and use single node for oss-gpt-120b recipe (#…

fc22656

…3454)

yao531441 pushed a commit to yao531441/dynamo that referenced this pull request May 13, 2026

feat: use generic image and use single node for oss-gpt-120b recipe (a…

6270c4b

…i-dynamo#3454)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use generic image and use single node for oss-gpt-120b recipe#3454

feat: use generic image and use single node for oss-gpt-120b recipe#3454
biswapanda merged 1 commit into
mainfrom
bis/oss-gpt-120b-1node

biswapanda commented Oct 7, 2025 •

edited

Loading

Uh oh!

coderabbitai Bot commented Oct 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

biswapanda commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

biswapanda commented Oct 7, 2025 •

edited

Loading

coderabbitai Bot commented Oct 7, 2025 •

edited

Loading