-
Notifications
You must be signed in to change notification settings - Fork 27k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gemma capping #34282
Open
ArthurZucker
wants to merge
49
commits into
main
Choose a base branch
from
gemma-capping
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+404
−532
Open
Gemma capping #34282
Changes from 13 commits
Commits
Show all changes
49 commits
Select commit
Hold shift + click to select a range
85d549a
softcapping
ArthurZucker eba5191
soft cap before the mask
ArthurZucker b9e4a54
style
ArthurZucker 514a839
...
ArthurZucker 7544feb
super nit
ArthurZucker be1b8c3
update
ArthurZucker 0e0511f
fixes
ArthurZucker 03ccc22
update
ArthurZucker bdda724
small issue with modular
ArthurZucker a2b6b12
fix modular imports
ArthurZucker 9365c1b
update
ArthurZucker 2108ee3
fixup
ArthurZucker 520120a
simplify a hell lot
ArthurZucker 314ed1f
simplify cleaning imports
ArthurZucker 8830473
finish fixing
ArthurZucker e4c19d7
update our design
ArthurZucker 7922210
nits
ArthurZucker fa1319d
Merge branch 'main' of github.com:huggingface/transformers into gemma…
ArthurZucker 43c68f6
use a deprecation cycle
ArthurZucker 1aec944
updates
ArthurZucker 93b53ef
Fix modular (recursive deps need to always be computed after merges!)
Cyrilvallez 6f3cabb
Merge branch 'gemma-capping' of github.com:huggingface/transformers i…
ArthurZucker a79c4a9
push
ArthurZucker 4c6d299
fix
ArthurZucker 607c45d
update
ArthurZucker 4598bba
fix modular order
Cyrilvallez 5727270
make fix-copies
ArthurZucker 198b4c4
updates
ArthurZucker 3d35151
update
ArthurZucker da050cd
?
ArthurZucker e02078c
don't compile for now
ArthurZucker 5861bbf
?
ArthurZucker 8c47da2
fix some stuff
ArthurZucker 09a88d9
donc!
ArthurZucker c06b530
fix copies
ArthurZucker 89e6f85
update
ArthurZucker 152e0b7
fixup
ArthurZucker 46d8fa7
Merge branch 'main' of github.com:huggingface/transformers into gemma…
ArthurZucker 006e869
?
ArthurZucker 159c65a
fix two tests
ArthurZucker 56ea5b9
fix?
ArthurZucker 4c3deb9
for now, don't use head info
ArthurZucker 9e3609d
eager when output attentoin and sdpa or flash as it's the simplest be…
ArthurZucker 21edaed
fix-copies
ArthurZucker b5d9819
revert sdpa check
ArthurZucker 5a3dade
Apply suggestions from code review
ArthurZucker faf433b
Merge branch 'main' of github.com:huggingface/transformers into gemma…
ArthurZucker 1da75e1
rebase, fix-copies and push
ArthurZucker aca9120
add a slow integration test
ArthurZucker File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Isn't it a bit misleading to use flex attn when we have
attn_implementation="sdpa"
? My concerns would beWdyt about making another attn implementation option for flex attn specifically? Not sure if this goes over the goal but control over the specific implementation is always appreciated.
Overall excited to see this, great work!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
SDPA version of
gemma
never "worked" TBH!I'll probably add a new class for flex attention, this was simpler for testing