You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've seen that there are commented lines flash-attention/csrc/flash_attn/flash_api.cpp in the codebase suggesting there have been attempts in the past at getting flash attention 2 working. This also suggests that there have been some significant barriers to this effort.
I'd like to try my hand at this task, but would really appreciate insights the authors or any other readers might have on this topic. What are the most significant obstacles? Is it the architecture-specific optimizations, dev time, tiling, or something else?
The text was updated successfully, but these errors were encountered:
Hi All,
I've seen that there are commented lines
flash-attention/csrc/flash_attn/flash_api.cpp
in the codebase suggesting there have been attempts in the past at getting flash attention 2 working. This also suggests that there have been some significant barriers to this effort.I'd like to try my hand at this task, but would really appreciate insights the authors or any other readers might have on this topic. What are the most significant obstacles? Is it the architecture-specific optimizations, dev time, tiling, or something else?
The text was updated successfully, but these errors were encountered: