FlashAttention1.0.9 error in 2080Ti #40

miraclewkf · 2024-03-15T03:44:40Z

I use FlashAttention1.0.9 and it support Turing GPU(2080Ti)，but I get the errors：
RuntimeError: FlashAttention backward for head dim > 64 requires A100 or H100 GPUs as the implementation needs a large amount of shared memory.

huyiming2018 · 2024-07-30T12:51:58Z

The main constraint is the size of shared memory.
As the above mentions, Head dim > 64 backward requires A100 or H100. The forward for head dim <= 128, and backward for head dim <= 64 works on other GPUs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FlashAttention1.0.9 error in 2080Ti #40

FlashAttention1.0.9 error in 2080Ti #40

miraclewkf commented Mar 15, 2024

huyiming2018 commented Jul 30, 2024

FlashAttention1.0.9 error in 2080Ti #40

FlashAttention1.0.9 error in 2080Ti #40

Comments

miraclewkf commented Mar 15, 2024

huyiming2018 commented Jul 30, 2024