Meaning of the num_stages variable. #512
-
Hi 👋 Is there any reference someone can point me towards about what exactly the number of stages means? Shared memory requirements appears to increase linearly with number of stages, so I guess it's some kind of buffering to increase memory bandwidth but if there's some more documentation about it it'd be helpful. Thanks |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Hello! The Triton compiler pipelines some loops using the new asynchronous copy instruction on A100. |
Beta Was this translation helpful? Give feedback.
Hello! The Triton compiler pipelines some loops using the new asynchronous copy instruction on A100.
num_stages
refers to the depth of this pipeline. Because data from the next iteration(s) is asynchronously prefetched into shared memory, the shared memory requirements of the kernel increases linearly