Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
48 commits
Select commit Hold shift + click to select a range
14e9db7
Update nemo-rl to latest
smahdavi4 Dec 9, 2025
ee566dc
Update start_grpo
smahdavi4 Dec 10, 2025
33cf653
legacy configs for nemo-rl
smahdavi4 Dec 10, 2025
f00cc8e
update dockerfile
smahdavi4 Dec 10, 2025
cf9c856
update nemo-rl to latest commit
smahdavi4 Jan 15, 2026
53a9e93
add one more comment
smahdavi4 Jan 15, 2026
b8c0aaf
Merge branch 'main' of github.com:NVIDIA-NeMo/Skills into smahdavi/ne…
smahdavi4 Jan 15, 2026
6dc97b8
Remove bfcl from llama nemotron tests
Kipok Jan 26, 2026
9379b25
Fix qwen3 bfcl test
Kipok Jan 26, 2026
4fa081e
Fix for clone and run
Kipok Jan 26, 2026
ab6e055
Fix bfcl and scicode prepare
Kipok Jan 26, 2026
1cc7892
Tmp change
Kipok Jan 26, 2026
86d9c71
Fix bfcl check results
Kipok Jan 26, 2026
9fedaf2
Update constraint
Kipok Jan 26, 2026
b1c56fb
Fix bfcl test
Kipok Jan 27, 2026
d670528
Fix problematic requirements in bfcl
Kipok Jan 27, 2026
d6dc7dd
Remove sentence transformers dep
Kipok Jan 27, 2026
d4e2ae4
Remove torch dep
Kipok Jan 27, 2026
2c7a3c4
Fixing installation problems
Kipok Jan 27, 2026
761c584
Update containers
Kipok Jan 27, 2026
2e99c71
Merge remote-tracking branch 'origin/smahdavi/nemo-rl-update' into ig…
Kipok Jan 27, 2026
673bb18
Merge branch 'igitman/bfcl-req-fixes' into igitman/sglang-update
Kipok Jan 27, 2026
c29dc37
Merge branch 'igitman/slurm-test-fixes' into igitman/sglang-update
Kipok Jan 27, 2026
95b50a1
Remove legacy and rollback grpo configs
smahdavi4 Jan 27, 2026
502df62
Remove legacy and rollback grpo configs
smahdavi4 Jan 27, 2026
b255a2d
Merge remote-tracking branch 'origin/smahdavi/nemo-rl-update' into ig…
Kipok Jan 27, 2026
8315af2
Update gsm-plus to hf api
Kipok Jan 27, 2026
68bd2e8
Merge branch 'igitman/slurm-test-fixes' into igitman/sglang-update
Kipok Jan 27, 2026
02f5e89
Update conversion script
Kipok Jan 28, 2026
5113e64
Merge branch 'smahdavi/nemo-rl-update' into igitman/sglang-update
Kipok Jan 28, 2026
394f177
Tmp change
Kipok Jan 28, 2026
0ee9c77
Adjust test for warmup
Kipok Jan 28, 2026
9c38000
Merge branch 'smahdavi/nemo-rl-update' into igitman/sglang-update
Kipok Jan 28, 2026
6dcbd91
Switch to a proper conversion script
Kipok Jan 28, 2026
487f0dd
Fix bfcl test
Kipok Jan 28, 2026
268f188
Merge branch 'igitman/slurm-test-fixes' into igitman/sglang-update
Kipok Jan 28, 2026
4f4881d
Remove unused parameter
Kipok Jan 28, 2026
a977a5f
Merge branch 'main' into smahdavi/nemo-rl-update
Kipok Jan 28, 2026
265228e
Fix for import
Kipok Jan 28, 2026
4a6833d
Merge branch 'smahdavi/nemo-rl-update' of https://github.com/NVIDIA/N…
Kipok Jan 28, 2026
d8266f4
Merge branch 'smahdavi/nemo-rl-update' into igitman/sglang-update
Kipok Jan 28, 2026
4192d30
Add extra automodel
Kipok Jan 28, 2026
68e1806
Merge branch 'smahdavi/nemo-rl-update' into igitman/sglang-update
Kipok Jan 28, 2026
9804714
Add copy for tokenizer files
Kipok Jan 28, 2026
23584c3
Merge branch 'smahdavi/nemo-rl-update' into igitman/sglang-update
Kipok Jan 28, 2026
51443e4
Roll-back bad change
Kipok Jan 29, 2026
457b75f
Adjust constraints
Kipok Jan 29, 2026
ed54624
Merge branch 'main' into igitman/sglang-update
Kipok Jan 29, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions cluster_configs/example-local.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,9 +15,9 @@
executor: local

containers:
trtllm: nvcr.io/nvidia/tensorrt-llm/release:1.0.0
vllm: vllm/vllm-openai:v0.10.1.1
sglang: lmsysorg/sglang:v0.5.4
trtllm: nvcr.io/nvidia/tensorrt-llm/release:1.3.0rc1
vllm: dockerfile:dockerfiles/Dockerfile.vllm
sglang: lmsysorg/sglang:v0.5.8
# dockerfile: for now can only specify relative to repo root
megatron: dockerfile:dockerfiles/Dockerfile.megatron
sandbox: dockerfile:dockerfiles/Dockerfile.sandbox
Expand Down
2 changes: 1 addition & 1 deletion dockerfiles/Dockerfile.vllm
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
FROM vllm/vllm-openai:v0.14.0
FROM vllm/vllm-openai:v0.14.1
RUN pip install "vllm[audio]"
# Required by vLLM for Qwen-VL model family (runtime dependency, not directly imported)
RUN pip install qwen-vl-utils
8 changes: 2 additions & 6 deletions dockerfiles/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -28,12 +28,8 @@ set `DOCKER_PLATFORM=linux/arm64` for the build script described above.

## Building trtllm image

We directly use official `nvcr.io/nvidia/tensorrt-llm/release:1.0.0` image for both amd64 and arm64.
We directly use official `nvcr.io/nvidia/tensorrt-llm/release:1.3.0rc1` image.

## Building sglang image

We directly use official `lmsysorg/sglang:v0.5.4` image.

## Building vllm image

We use official `vllm/vllm-openai:v0.10.2` image with the additional `vllm[audio]` dependencies.
We directly use official `lmsysorg/sglang:v0.5.8` image.
4 changes: 2 additions & 2 deletions nemo_skills/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,9 +16,9 @@

# only used in ns setup command to initialize with defaults
_containers = {
"trtllm": "nvcr.io/nvidia/tensorrt-llm/release:1.0.0",
"trtllm": "nvcr.io/nvidia/tensorrt-llm/release:1.3.0rc1",
"vllm": "dockerfile:dockerfiles/Dockerfile.vllm",
"sglang": "lmsysorg/sglang:v0.5.4",
"sglang": "lmsysorg/sglang:v0.5.8",
"megatron": "dockerfile:dockerfiles/Dockerfile.megatron",
"sandbox": "dockerfile:dockerfiles/Dockerfile.sandbox",
"nemo-skills": "dockerfile:dockerfiles/Dockerfile.nemo-skills",
Expand Down
4 changes: 2 additions & 2 deletions tests/gpu-tests/test-local.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,9 +15,9 @@
executor: local

containers:
trtllm: nvcr.io/nvidia/tensorrt-llm/release:1.0.0
trtllm: nvcr.io/nvidia/tensorrt-llm/release:1.3.0rc1
vllm: dockerfile:dockerfiles/Dockerfile.vllm
sglang: lmsysorg/sglang:v0.5.4
sglang: lmsysorg/sglang:v0.5.8
sandbox: dockerfile:dockerfiles/Dockerfile.sandbox
nemo-skills: dockerfile:dockerfiles/Dockerfile.nemo-skills
megatron: dockerfile:dockerfiles/Dockerfile.megatron
Expand Down
6 changes: 3 additions & 3 deletions tests/slurm-tests/qwen3_4b_evals/check_results.py
Original file line number Diff line number Diff line change
Expand Up @@ -24,13 +24,13 @@
TOOLCALLING_METRIC_RANGES = {
("overall_accuracy", "accuracy"): (61.0, 67.0),
("overall_non_live", "accuracy"): (84.0, 90.0),
("non_live_ast", "accuracy"): (85.0, 92.0),
("non_live_ast", "accuracy"): (84.0, 92.0),
("non_live_irrelevance", "accuracy"): (79.0, 86.0),
("overall_live", "accuracy"): (76.0, 83.0),
("live_ast", "accuracy"): (79.0, 86.0),
("live_irrelevance", "accuracy"): (73.0, 80.0),
("live_relevance", "accuracy"): (70.0, 90.0), # unusually high variance
("overall_multi_turn", "accuracy"): (20.0, 30.0),
("live_relevance", "accuracy"): (70.0, 100.0), # unusually high variance
("overall_multi_turn", "accuracy"): (20.0, 33.0),
}


Expand Down