-
Notifications
You must be signed in to change notification settings - Fork 173
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Support top-p and top-k
CI:L1
Run doctests, unit tests, and functional tests
#1578
opened Nov 27, 2025 by
zhandaz
Loading…
3 of 4 tasks
fix: Fix the sequence padding for FP8 case
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1569
opened Nov 25, 2025 by
guyueh1
Loading…
4 tasks
feat: plot vllm internal metrics to the wandb log
CI:L1
Run doctests, unit tests, and functional tests
ease of use
#1567
opened Nov 25, 2025 by
youngeunkwon0405
Loading…
4 tasks
chore: Bump vllm to 0.11.2, torch to 2.9, transformers to 4.57.1
CI:L1
Run doctests, unit tests, and functional tests
docs: Create performance-summary.md for NeMo RL
documentation
Improvements or additions to documentation
#1560
opened Nov 24, 2025 by
snowmanwwg
Loading…
fix: fix Dtensor sharding error when bump up pytorch version
#1557
opened Nov 21, 2025 by
ZhiyuLi-Nvidia
Loading…
4 tasks
fix: remove sft-qwen2.5-fsdp2tp8sp from nighlies
CI:L0
Run doctests and unit tests
#1555
opened Nov 20, 2025 by
ahmadki
Loading…
fix: add H200 TFLOPS
CI:L0
Run doctests and unit tests
community-request
#1543
opened Nov 19, 2025 by
clumsy
Loading…
4 tasks done
feat: refactor dtensor policy v2 into core modular functions
#1542
opened Nov 19, 2025 by
hemildesai
•
Draft
4 tasks
fix: Use Float16Module even when defer_fp32_logits=True
CI:L1
Run doctests, unit tests, and functional tests
#1537
opened Nov 18, 2025 by
yfw
Loading…
4 tasks
feat: Automodel init for DTensorPolicyV2
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1509
opened Nov 12, 2025 by
adil-a
Loading…
refactor: refactor env and data processor & add nemotron super 49b recipes
documentation
Improvements or additions to documentation
#1506
opened Nov 11, 2025 by
yuki-97
Loading…
build: Use dynamic engine for generate.
CI:L1
Run doctests, unit tests, and functional tests
#1502
opened Nov 11, 2025 by
shanmugamr1992
Loading…
4 tasks
feat: pipeline-rl style # of inflight prompt regulation
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
#1499
opened Nov 10, 2025 by
youngeunkwon0405
Loading…
4 tasks
fix: Support vLLM DP+EP in async engine via Ray-level data parallelism
community-request
#1495
opened Nov 10, 2025 by
clumsy
Loading…
4 tasks done
feat: allow uv-less execution and fingerprint the environment
CI:L1
Run doctests, unit tests, and functional tests
CI
Relating to CI
documentation
Improvements or additions to documentation
#1491
opened Nov 9, 2025 by
terrykong
Loading…
fix: Megatron static inference and adapt to mcore engine API changes
CI:L1
Run doctests, unit tests, and functional tests
r0.4.0
#1488
opened Nov 7, 2025 by
shanmugamr1992
Loading…
4 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.