Skip to content

feat: Use openmathinstruct2 training in grpo math example#18

Merged
parthchadha merged 11 commits intomainfrom
pchadha/openmathinstruct2
Mar 21, 2025
Merged

feat: Use openmathinstruct2 training in grpo math example#18
parthchadha merged 11 commits intomainfrom
pchadha/openmathinstruct2

Conversation

@parthchadha
Copy link
Copy Markdown
Contributor

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

  • Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

Checklist when contributing

  • TBD

Additional Information

  • Related to # (issue)

@parthchadha parthchadha requested review from SahilJain314 and terrykong and removed request for SahilJain314 March 21, 2025 09:14
@parthchadha parthchadha force-pushed the pchadha/openmathinstruct2 branch from e3a2e9a to c4f14b9 Compare March 21, 2025 22:14
@parthchadha parthchadha force-pushed the pchadha/openmathinstruct2 branch from c4f14b9 to dc02972 Compare March 21, 2025 22:25
SahilJain314
SahilJain314 previously approved these changes Mar 21, 2025
parthchadha and others added 9 commits March 21, 2025 15:30
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
- flatten hyperparams for tb no longer errors for lists (was an issue for schedulers)
- the submission script now overlaps the head on the first worker (no longer needs extra node just for head)
- fixes the CI to handle weird permissions issues
- added sphinx build and doctest to CI
- added functional tests to CI
- nuked an old example
- added docs for functional tests
- --no-container-mount-home
- fix a unit tests that expected cuda to skip
- allow running unit tests on slurm head node with no gpu
- add a hermetic script to run functional tests

Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Co-authored-by: Terry Kong <terrycurtiskong@gmail.com>
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
…es (#20)

Signed-off-by: Parth Chadha <pchadha@nvidia.com>
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
@parthchadha parthchadha force-pushed the pchadha/openmathinstruct2 branch from dc02972 to 2d0f943 Compare March 21, 2025 22:30
@github-actions github-actions bot added documentation Improvements or additions to documentation CI Relating to CI and removed documentation Improvements or additions to documentation CI Relating to CI labels Mar 21, 2025
…uct2

Signed-off-by: Parth Chadha <pchadha@nvidia.com>
@parthchadha parthchadha merged commit 06bc57d into main Mar 21, 2025
5 checks passed
@parthchadha parthchadha deleted the pchadha/openmathinstruct2 branch March 21, 2025 22:52
KiddoZhu pushed a commit that referenced this pull request May 6, 2025
Signed-off-by: Parth Chadha <pchadha@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Sahil Jain <sahilj@nvidia.com>
Co-authored-by: Terry Kong <terrycurtiskong@gmail.com>
Co-authored-by: Sahil Jain <48468750+SahilJain314@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants