Support Multiple Model initialization by gwarmstrong · Pull Request #1018 · NVIDIA-NeMo/Skills

gwarmstrong · 2025-10-31T22:11:55Z

This PR supports having multiple models initialized in a generate command, like so:


from nemo_skills.pipeline.cli import wrap_arguments, generate

generate(
    ctx=wrap_arguments(""),
    cluster="ord",
    partition="interactive",
    model=["/hf_models/Qwen3-32B", "/hf_models/Qwen3-8B"],
    server_gpus=8,  # Broadcast to both models
    server_nodes=[2,1], # 2 nodes for 32B, 1 node for 8B
    server_type="vllm",
    generation_module="multiturn-conversation/multiturn_conversation.py",
    input_file="/nemo_run/code/multiturn-conversation/sample_prompts.jsonl",
    output_dir=f"/workspace/multiturn-conversation/minimal_example/output",
)

This generation module is not available in the branch (and likely won't due to the specificity of its implementation) but reach out to me if you want the example.

Signed-off-by: George Armstrong <georgea@nvidia.com>

nemo_skills/pipeline/utils/declarative.py

nemo_skills/pipeline/utils/generation.py

nemo_skills/pipeline/generate.py

Signed-off-by: George Armstrong <georgea@nvidia.com>

Kipok

lgtm, thanks! Feel free to merge as long as both gpu and slurm tests are passing

gwarmstrong · 2025-11-21T20:38:59Z

Closed in favor of #1052

gwarmstrong added 17 commits October 31, 2025 13:27

WIP multiturn draft

5f7ad33

Signed-off-by: George Armstrong <georgea@nvidia.com>

WIP fixes and logging

6d9684b

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT remove debug and move imports to top

70eac83

Signed-off-by: George Armstrong <georgea@nvidia.com>

ENH better engagement between turns

23eefad

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT convert multi generation command interface

16385fd

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT fix support for multiturn generation type

5983e60

Signed-off-by: George Armstrong <georgea@nvidia.com>

FIX port passing

676815a

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT move the generation module out of nemo-skills

f672c61

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT revert to import module

596ba1e

Signed-off-by: George Armstrong <georgea@nvidia.com>

Revert factory

2a6e83e

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT cleanup

d175e0a

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT cleanup

63ee9f2

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT cleanup

3411e26

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT add slurm helper

30fa8bb

Signed-off-by: George Armstrong <georgea@nvidia.com>

FIX server config single model

7969706

Signed-off-by: George Armstrong <georgea@nvidia.com>

FIX server config single model

7dea10c

Signed-off-by: George Armstrong <georgea@nvidia.com>

FIX merge indentation error

ea28acb

Signed-off-by: George Armstrong <georgea@nvidia.com>

gwarmstrong mentioned this pull request Oct 31, 2025

Handle side-effects from Command class #937

Closed

gwarmstrong added 4 commits November 3, 2025 15:06

FIX typer option make all list

8c71823

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT add copyright

70d07c0

Signed-off-by: George Armstrong <georgea@nvidia.com>

FIX default list parameters

ddcd371

Signed-off-by: George Armstrong <georgea@nvidia.com>

update tests with new helpers

d461275

Signed-off-by: George Armstrong <georgea@nvidia.com>

Kipok reviewed Nov 4, 2025

View reviewed changes

gwarmstrong added 6 commits November 4, 2025 10:31

Merge branch 'main' into georgea/multiturn-conversation

4250714

FIX and DOC model and parameter broadcasting

2bd3852

Signed-off-by: George Armstrong <georgea@nvidia.com>

code review: move to utils, decrease redundancy

da2fdd6

Signed-off-by: George Armstrong <georgea@nvidia.com>

TST update pipeline tests to have functional rather than stateful checks

d31fdd8

Signed-off-by: George Armstrong <georgea@nvidia.com>

MAINT rename with model_idx

5744618

Signed-off-by: George Armstrong <georgea@nvidia.com>

Merge branch 'main' into georgea/multiturn-conversation

bf95bb5

gwarmstrong requested a review from Kipok November 4, 2025 21:45

gwarmstrong added 3 commits November 4, 2025 15:50

Merge branch 'main' into georgea/multiturn-conversation

e9ccfb8

Merge branch 'main' into georgea/multiturn-conversation

2e588e4

Merge branch 'main' into georgea/multiturn-conversation

452fa7d

Kipok added the run GPU tests label Nov 6, 2025

gwarmstrong added 2 commits November 7, 2025 12:51

WIP try to convert multi-model updates to use existing args

baa3d7d

Signed-off-by: George Armstrong <georgea@nvidia.com>

Merge branch 'main' into georgea/multiturn-conversation

494c64f

Signed-off-by: George Armstrong <georgea@nvidia.com>

gwarmstrong added run GPU tests and removed run GPU tests labels Nov 7, 2025

Kipok approved these changes Nov 8, 2025

View reviewed changes

gwarmstrong self-assigned this Nov 10, 2025

Merge branch 'main' into georgea/multiturn-conversation

28554fa

gwarmstrong closed this Nov 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Multiple Model initialization#1018

Support Multiple Model initialization#1018
gwarmstrong wants to merge 33 commits intomainfrom
georgea/multiturn-conversation

gwarmstrong commented Oct 31, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Kipok left a comment

Uh oh!

gwarmstrong commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gwarmstrong commented Oct 31, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Kipok left a comment

Choose a reason for hiding this comment

Uh oh!

gwarmstrong commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants