Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add transfer modules #40

Open
wants to merge 17 commits into
base: main
Choose a base branch
from
Open

add transfer modules #40

wants to merge 17 commits into from

Conversation

hy395
Copy link

@hy395 hy395 commented Sep 4, 2024

Description of your changes

  • add hound_transfer.py:

    • main script for running transfer;
  • add transfer_helper.py:

    • helper functions to insert adapter layers to existing architecture.
    • helper functions to merge adapters.
  • layers.py:

    • added IA3, IA3_ff, Lora, Locon, AdapterHoulsby class definitions;
    • add seqlen_train argument to allow model to take any input seq length;
    • add additional arguments to SqueezeExcite class;
  • blocks.py:

    • add options to specify adapter in json file,
    • add additional arguments to SqueezeExcite.
  • seqnn.py:

    • add methods to insert adapter;
  • hound_eval.py:

    • optionally use fl16;
  • hound_eval_spec.py:

    • optionally use fl16;
  • trainer.py:

    • add option to do loss scaling;
    • add function to print GPU memory usage when training;

evaluation code:

  • add borzoi_test_genes.py
  • add pygene.py to support borzoi_test_genes.py

test:

  • code to test inserting ia3; see if added trainable variables match expectation.

Issue ticket number and link

Type of change

  • Bug fix
  • New feature
    • Backwards Incompatible?
  • Refactoring / code clean-up
  • Documentation add / update
  • Automated Test
  • Other (please specify)

(If applicable) How has this been tested?

see if trainable variables match expectation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant