-
Notifications
You must be signed in to change notification settings - Fork 15
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* feat(llama): import RoPE scaling code This is imported from the original Llama reference implementation: https://github.com/meta-llama/llama-models/blob/7890266c5a3ccd29e739d53a71ea968bcf4ca400/models/llama3/reference_impl/model.py#L45 Note that the function does not have any effect on the original model code as long as the use_scaled parameter is false (the default). * feat(llama): add RopeScalingArgs These are aligned with HF ones, so it will be easier to implement rope scaling as it is done in Llama3.1. * feat(llama): support rope scaling arguments to improve flexibility * chore: relax safetensors pattern on download * feat: untie weights when needed (i.e.: Llama3.2-1B) * feat: add support for Llama3.1 - 3.2 and 3.3 models
- Loading branch information
1 parent
08e4977
commit bb174b6
Showing
5 changed files
with
126 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters