Replies: 1 comment
-
Have you tried with the latest version of triton ? By reproducing the tutorial and changing the shapes as above, I have:
as expected. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In the Matrix Multiplication tutorial, there is a comment and an assertion that claim if
K % BLOCK_SIZE_K != 0
, there would be out-of-bound memory access (which makes sense).But if I change the input matrices to shape (512, 513) and (513, 512) (i.e. K=513), with BLOCK_SIZE_K=32, the code still works and result is close to cublas result. What's going on?
Beta Was this translation helpful? Give feedback.
All reactions