NVIDIA NeMo-Aligner v0.1.0
Highlights
First open source release of NeMo-Aligner. Featuring:
- Support for the full Reinforcement Learning from Human Feedback(RLHF) pipeline including SFT, Reward Model Training and Reinforcement Learning
- Support for the SteerLM technique
- Support for Direct Preference Optimization
- Support for all Megatron Core GPT models such as LLAMA2 70B
Container
docker pull nvcr.io/ea-bignlp/ga-participants/nemofw-training:23.11
To get access:
- Sign up to get free and immediate access to NVIDIA NeMo Framework container. If you don’t have an NVIDIA NGC account, you will be prompted to sign up for an account before proceeding.
- If you don’t have an NVIDIA NGC API key, sign into NVIDIA NGC, selecting organization/team:
ea-bignlp/ga-participants
and click Generate API key. Save this key for the next step. Else, skip this step. - On your machine, docker login to nvcr.io using
docker login nvcr.io
Username: $oauthtoken
Password: <Your Saved NGC API Key>