Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Converting a Megatron-LM ckpt to nemo so we can use NeMo-Aligner for post-training #340

Open
abgoswam opened this issue Oct 10, 2024 · 0 comments

Comments

@abgoswam
Copy link

abgoswam commented Oct 10, 2024

Following the GPT Pretraining section in the Megatron-LM repo, we are able to successfully train a model using Megatron-LM

  • For post-training, we want to use NeMo-Aligner.
  • But the instructions refer to a .nemo model.

I saw pointers on how to convert from HF to nemo. example convert_llama_hf_to_nemo.py

However I did not get any examples of converting a ckpt saved using the Megatron-LM to the .nemo format . Any pointers on this ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant