-
Notifications
You must be signed in to change notification settings - Fork 269
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve Hugging Face SFT Script #539
base: main
Are you sure you want to change the base?
Conversation
Let's check whether |
There seems to be deprecated args in data preparation and will require a re-write. |
Can you help with that? |
Yes. Can you please help find the PRs/diff for the SFT Trainer like you have previously done here. That could be quite helpful:) |
You could use git blame and commit history on those module files to find relevant changes. |
Fixes #487
I've chosen to remove the deprecated parameter as previously mentioned in the issue. The sequence length for the training dataset can be specified using the HF datasets library as mentioned here.
Please let me know if any further rectification is required and I will make the necessary changes.
cc: @Tcc0403