-
Notifications
You must be signed in to change notification settings - Fork 169
refactor: refactor env and data processor & add nemotron super 49b recipes #1506
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
75f3d5c to
5ebbc73
Compare
c9335d4 to
a872ed6
Compare
b7fedb9 to
9078e33
Compare
c0bfaa6 to
ab0ac80
Compare
Signed-off-by: Yuki Huang <[email protected]> Signed-off-by: ruit <[email protected]>
Signed-off-by: Yuki Huang <[email protected]> Signed-off-by: ruit <[email protected]>
Signed-off-by: Yuki Huang <[email protected]> Signed-off-by: ruit <[email protected]>
Signed-off-by: ruit <[email protected]>
Signed-off-by: Yuki Huang <[email protected]> Signed-off-by: ruit <[email protected]>
… processors. Added raw_dataset.py and path.py for improved dataset processing. Updated project-includes in pyrefly.toml and modified grpo.md to reflect new task-dataset mapping. Cleaned up unused code and configurations in various YAML files. Signed-off-by: ruit <[email protected]>
…or handling
- Introduced documentation for the new Code Jaccard Environment, detailing its functionality, usage, and configuration.
- Updated RawDataset class to provide a default processor if none is specified in the data configuration.
- Enhanced test coverage for the helpsteer3 data processor to ensure correct functionality and output.
Signed-off-by: ruit <[email protected]>
Signed-off-by: ruit <[email protected]>
f41ee48 to
eb8aa50
Compare
|
- Updated CLEVRCoGenTDataset, OpenAIFormatDataset, and SquadDataset to inherit from the RawDataset class for improved dataset handling. - Added necessary imports for RawDataset in the respective files. Signed-off-by: ruit <[email protected]>
|
…up for vlm grpo - Added `env_name` to `vlm_grpo_3B_megatron.yaml` and `vlm_grpo_3B.yaml` for environment specification. - Modified `setup_data` function in `run_vlm_grpo.py` to use `env_name` for environment configuration, enhancing flexibility in dataset processing. Signed-off-by: ruit <[email protected]>
|
Follow up of #1472. Thanks @nv-mmanohara for adding this!
run_grpo.py, will replacerun_grpo_math.pyandrun_grpo_rm.pyin a subsequent PR.Summary by CodeRabbit
New Features
Documentation
Tests