Skip to content

Latest commit

 

History

History
49 lines (45 loc) · 1.4 KB

train_ppo_llama_with_reward_fn.sh

File metadata and controls

49 lines (45 loc) · 1.4 KB