-
Notifications
You must be signed in to change notification settings - Fork 346
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[C/JAX] Support more mask types for the arbitrary seqlen kernels and …
…minor changes of JAX bias (#469) * Move bias to float32 Signed-off-by: Reese Wang <[email protected]> * Enable varlen Signed-off-by: Reese Wang <[email protected]> * Increase neg infinity abs values Signed-off-by: Reese Wang <[email protected]> * Enable varlen tests Signed-off-by: Reese Wang <[email protected]> * Remove unnecessary code Signed-off-by: Reese Wang <[email protected]> * Fix lint Signed-off-by: Reese Wang <[email protected]> * Support variable sequence length after cuDNN 8.9.6 Signed-off-by: Reese Wang <[email protected]> * Use unique_ptr instead of shared_ptr Signed-off-by: Reese Wang <[email protected]> * Add a new mask type: PADDING_CAUSAL_MASK Signed-off-by: Reese Wang <[email protected]> * Support flash padding mask after 8.9.6 Signed-off-by: Reese Wang <[email protected]> * Enhance the Max512 handling for causal masking and add the related tests Signed-off-by: Reese Wang <[email protected]> * Update the fused attn support lists Signed-off-by: Reese Wang <[email protected]> * Remove padding_aware from the caching Signed-off-by: Reese Wang <[email protected]> * Fix libtransformer.so issue Signed-off-by: Reese Wang <[email protected]> * Reduce the pad ratio tests Signed-off-by: Reese Wang <[email protected]> * Fix a bug with cuDNN 8.9.5 Signed-off-by: Reese Wang <[email protected]> * Release backend resource after the module level unit test Signed-off-by: Reese Wang <[email protected]> * Clean the jax live arrays before running the unit tests Signed-off-by: Reese Wang <[email protected]> * Fix too-few-public-methods lint Signed-off-by: Reese Wang <[email protected]> --------- Signed-off-by: Reese Wang <[email protected]>
- Loading branch information
Showing
13 changed files
with
500 additions
and
166 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.