Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: per-worker active/idle timeline + IFB size logging CI:L1 Run doctests, unit tests, and functional tests
#1534 opened Nov 18, 2025 by youngeunkwon0405 Draft
4 tasks
feat: Support qwen3-next, mcore path
#1530 opened Nov 17, 2025 by ahmadki Loading…
1 task
feat: force on-policy ratio to 1
#1529 opened Nov 17, 2025 by yfw Draft
4 tasks
perf: perf script change for qwen30b-a3b CI:docs Run doctest
#1526 opened Nov 15, 2025 by youngeunkwon0405 Loading…
4 tasks
feat: RL sampler [WIP]
#1522 opened Nov 14, 2025 by pjin-nvidia Draft
4 tasks
feat: Add moe load balancing metrics
#1520 opened Nov 13, 2025 by yfw Draft
4 tasks
feat: refactor dtensor policy v2 into modular functions CI:L0 Run doctests and unit tests
#1511 opened Nov 12, 2025 by hemildesai Draft
4 tasks
feat: Automodel init for DTensorPolicyV2 CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1509 opened Nov 12, 2025 by adil-a Loading…
feat: add llama3.3 nemotron super 49b recipes CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1506 opened Nov 11, 2025 by yuki-97 Loading…
build: Use dynamic engine for generate.
#1502 opened Nov 11, 2025 by shanmugamr1992 Loading…
4 tasks
feat: pipeline-rl style # of inflight prompt regulation CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1499 opened Nov 10, 2025 by youngeunkwon0405 Loading…
4 tasks
feat: allow uv-less execution and fingerprint the environment CI:L1 Run doctests, unit tests, and functional tests CI Relating to CI documentation Improvements or additions to documentation
#1491 opened Nov 9, 2025 by terrykong Loading…
fix: Megatron static inference and adapt to mcore engine API changes CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1488 opened Nov 7, 2025 by shanmugamr1992 Loading…
4 tasks
feat: Add AceMathRL recipe
#1484 opened Nov 6, 2025 by ffrujeri Draft
4 tasks
feat: fp16 for DTensor policies
#1474 opened Nov 5, 2025 by adil-a Loading…
Mmanohara/merge grpo helpsteer cp tp community-request
#1472 opened Nov 4, 2025 by nv-mmanohara Loading…
4 tasks
feat: DTensorPolicyV2 GPT-OSS support CI:L0 Run doctests and unit tests
#1470 opened Nov 4, 2025 by adil-a Loading…
build: Ensure automodel has deepep and TE
#1456 opened Oct 31, 2025 by chtruong814 Loading…
4 tasks
feat: Random dataset with specified input and output sequence length CI:L0 Run doctests and unit tests
#1453 opened Oct 31, 2025 by guyueh1 Loading…
4 tasks
feat: Add GPT-OSS support via mcore
#1452 opened Oct 31, 2025 by ashors1 Draft
4 tasks
feat: [draft do not merge] Fp8 moe rollout CI:L0 Run doctests and unit tests documentation Improvements or additions to documentation
#1446 opened Oct 29, 2025 by guyueh1 Loading…
4 tasks
fix: add theoretical TFlops for H200 GPU CI:L0 Run doctests and unit tests
#1422 opened Oct 24, 2025 by roclark Loading…
4 tasks done
DRAFT: feat: Enable simulated user for multi-turn GRPO
#1412 opened Oct 22, 2025 by ahmadki Loading…
4 tasks
ProTip! no:milestone will show everything without a milestone.