-
Notifications
You must be signed in to change notification settings - Fork 363
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Miscellaneous fixes for core attention (#344)
* miscellenous fixes Signed-off-by: Charlene Yang <[email protected]> * add back pytorch csrc extensions.h Signed-off-by: Charlene Yang <[email protected]> * add unit tests for dpa checkpointing Signed-off-by: Charlene Yang <[email protected]> * remove seqlen%32/64 checks for now Signed-off-by: Charlene Yang <[email protected]> * fix tests for core attn bias Signed-off-by: Charlene Yang <[email protected]> * add tests for changes regarding rng_state in aux_ctx_tensor Signed-off-by: Charlene Yang <[email protected]> * reuse rng tracker from numerics in fused attn; skip checkpointing if FAv2 in numerics Signed-off-by: Charlene Yang <[email protected]> * uncomment comments used for testing Signed-off-by: Charlene Yang <[email protected]> * fix pre/post scale bias Signed-off-by: Charlene Yang <[email protected]> * Update transformer_engine/pytorch/attention.py Co-authored-by: Tim Moon <[email protected]> Signed-off-by: cyanguwa <[email protected]> * remove skipifs for FAv2 check after PR366 Signed-off-by: Charlene Yang <[email protected]> * remove checkpointing tests for transformer layer; dpa tests still provide coverage Signed-off-by: Charlene Yang <[email protected]> * adjust random number range for tests Signed-off-by: Charlene Yang <[email protected]> * Add upper bound to FA version Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * Check backend only when using FusedAttention Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * remove imports/variables related to FAv2 checks Signed-off-by: Charlene Yang <[email protected]> * further fix random number ranges for tests Signed-off-by: Charlene Yang <[email protected]> * fix variable referenced before assignment error Signed-off-by: Charlene Yang <[email protected]> --------- Signed-off-by: Charlene Yang <[email protected]> Signed-off-by: cyanguwa <[email protected]> Signed-off-by: Kirthi Shankar Sivamani <[email protected]> Co-authored-by: Tim Moon <[email protected]> Co-authored-by: Kirthi Shankar Sivamani <[email protected]>
- Loading branch information
1 parent
a0f4435
commit cbfb8c6
Showing
8 changed files
with
208 additions
and
111 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.