Pass to block dynamic dimensions of operands of `iree_linalg_ext.attention`. #18874

…pse_shape` into `flow.dispatch.tensor.load/store`. This also cleans up the implementation of these patterns to avoid using templated code that is hard to read/maintain. Signed-off-by: MaheshRavishankar <[email protected]>

…ntion`. The use of `IntegerRangeAnalysis` and `IntegerDivisibilityAnalysis` gives range and divisibility information for constants passed to the dispatch. This can be used to infer the range and divisibility information for all tensor values in the dispatch. This PR adds an analysis to do this. This analysis is then used to expand the dimensions of operands of the attention operation that are dynamic, but are known to be divisible by a compile-time static value. This gets the operations into a form that can be compiled by the AMDGPU backend and target the mfma intrinsics. Signed-off-by: MaheshRavishankar <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pass to block dynamic dimensions of operands of `iree_linalg_ext.attention`. #18874

Pass to block dynamic dimensions of operands of `iree_linalg_ext.attention`. #18874

Commits on Oct 23, 2024

Pass to block dynamic dimensions of operands of iree_linalg_ext.attention. #18874

Are you sure you want to change the base?

Pass to block dynamic dimensions of operands of iree_linalg_ext.attention. #18874

Commits on Oct 23, 2024

Pass to block dynamic dimensions of operands of `iree_linalg_ext.attention`. #18874

Pass to block dynamic dimensions of operands of `iree_linalg_ext.attention`. #18874