Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Added training scripts Independent of Hugging Face's Trainer #27

Open
wants to merge 50 commits into
base: main
Choose a base branch
from

Conversation

chizuchizu
Copy link

@chizuchizu chizuchizu commented Nov 27, 2023

Added training scripts Independent of Hugging Face's Trainer

Description

Purpose of the PR

  • Added training scripts that don't depend on Hugging Face's Trainer class.

Related Issue(s)

  • Nothing.

Changes

  • Added train_ds.py
  • Added utils scripts imported from Microsoft VisualChat.
  • Added the training config and the training shell script.
  • Update documentation on training and inference methods.

Model/Algorithm Performance

  • Trained the GIT model following projects/opt/exp002_ds.yml with both ZeRO-2 and ZeRO-3, confirming valid inference.
  • Trained Video BLIP with ZeRO-3.
  • Observed an increase in GPU memory consumption compared to when using Trainer.

Dependencies

  • Nothing.

Reviewer Notes

Confirmed

  • I have updated the documentation accordingly.
  • I have adhered to the coding standards and guidelines of this project.
  • I have added comments, especially in hard-to-understand areas.

@chizuchizu chizuchizu self-assigned this Nov 27, 2023
@chizuchizu chizuchizu marked this pull request as ready for review November 27, 2023 10:14
@chizuchizu chizuchizu changed the title feat: Add training scripts Independent of Hugging Face's Trainer feat: Added training scripts Independent of Hugging Face's Trainer Nov 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant