Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Convert the checkpoint
Step 1: download an olmo checkpoint (e.g,. from Luca)
Step 2: install the custom branch:
pip install git+https://github.com/vwxyzjn/transformers.git@olmo1124_classification
Step 3: run the converter (replace
exp_name
with huggingface revision you want to use)Run SFT / DPO training
To make our codebase / image work with
Olmo1124ForCausalLM
, we just need to runpip install git+https://github.com/vwxyzjn/transformers.git@olmo1124_classification
before running the SFT or DPO commands like this:You can also launch on augusta using the following command (augusta only works with
costah/open_instruct_ppo_ray_ninja
image):My
configs/train_configs/sft/tulu3_8b_preview_mix_v3.9.yaml
yaml looks like