-
Couldn't load subscription status.
- Fork 89
Closed
Description
Hi @Zhihan1996,
You might also be interested in this PR which updates MosaicBERT to FlashAttention 2 and removes all the issues with the custom Triton FlashAttention + ALiBi implementation.
Metadata
Metadata
Assignees
Labels
No labels