-
Notifications
You must be signed in to change notification settings - Fork 3.1k
commit dislora #11122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
commit dislora #11122
Conversation
|
|
||
| train_ds, dev_ds, test_ds = create_dataset(data_args, training_args) | ||
|
|
||
| train_dataset_size = None |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
为什么需要get datasize?
paddlenlp/trl/sft_trainer.py
Outdated
| if max_eval_iters > 0 and step >= max_eval_iters - 1: | ||
| break | ||
|
|
||
| def _calc_ortho_loss(self, model): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
不要写在SFTTrainer里,peft/dislora目录下新起一个DisLoRATrainer
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
hi, @Pioneer-wxh
|
* commit dislora * add DisLoRATrainer * “readme文件修改” * Delete docs/zh/llm/benchmark/rl/README.md
Before submitting
testsfolder. If there are codecov issues, please add tests cases first.PR types
PR changes
Description