Reapply "[Layouts] Propagate layouts into conditionals (#5610)" #5725

Mogball · 2025-01-28T06:35:05Z

This is a re-land of #5610 with a fix to DotOperandEncodingAttr to resolve a crash exposed in internal tests.

At the same time, this slightly alters the implementation of the algorithm to ensure that layouts don't get pushed back out of conditionals by the forward propagation pass. Originally, the goal of hoisting across conditionals was to ensure that cvts in fused inner loops are placed inside the prologue before pipelining peels the prologue and introduces a chain of dependencies.

My guess is that @lezcano's changes to strengthen layout propagation enabled the forward propagation pass to push the conversion back out of the prologue even after pipelining (which is good), but I explicitly disabled hoisting into chains of conditionals, so these didn't balance out. This PR alters the algorithm to consider hoisting cvts across chains of conditionals only for subslices inside for loops.

This reverts commit 98b40d5.

Mogball · 2025-01-28T06:35:45Z

lib/Dialect/TritonGPU/IR/Dialect.cpp

+  if (auto blocked = mlir::dyn_cast<BlockedEncodingAttr>(parent)) {
+    auto shapePerCTA =
+        expandMatrixShapeWithBatch(ArrayRef(getShapePerCTA(*this, shape)));
+    auto shapePerCTATile =


@lezcano @Jokeren I have no idea if this logic is correct. Would appreciate if one of you two could check this for me :)

Even better, just create the associated LinearLayout, with it create a LinearLayoutEncoding and call getElemsPerThread

Seems legit

Seems fine to me as the LinearLayout object can be cached.

This appears to be causing a unit test to fail. Digging in...

Sometimes the unit test itself can be wrong

Here's the failure. It seems that going through LinearEncodingAttr here is causing extra 0 bases to be added to the register input dimension (this test is Slice(Dot) to LL; incidentally, it's Slice(Dot(Blocked)) that's actually crashing prior to this PR):

/Users/jeffniu/code/triton/unittest/Dialect/TritonGPU/LinearLayoutConversionsTest.cpp:824: Failure Expected equality of these values: toLinearLayout({16}, sliceV2) Which is: - register=1 -> (8) register=2 -> (0) register=4 -> (0) register=8 -> (0) register=16 -> (0) - lane=1 -> (0) lane=2 -> (0) lane=4 -> (1) lane=8 -> (2) lane=16 -> (4) - warp is a size 1 dimension - block is a size 1 dimension where out dims are: [dim0 (size 16)] LinearLayout( { {S("register"), {{8}}}, {S("lane"), {{0}, {0}, {1}, {2}, {4}}}, {S("warp"), {}}, {S("block"), {}}, }, {S("dim0")}) Which is: - register=1 -> (8) - lane=1 -> (0) lane=2 -> (0) lane=4 -> (1) lane=8 -> (2) lane=16 -> (4) - warp is a size 1 dimension - block is a size 1 dimension where out dims are: [dim0 (size 16)] /Users/jeffniu/code/triton/unittest/Dialect/TritonGPU/LinearLayoutConversionsTest.cpp:839: Failure Expected equality of these values: toLinearLayout({16}, sliceV3) Which is: - register=1 -> (1) register=2 -> (8) register=4 -> (0) - lane=1 -> (2) lane=2 -> (4) lane=4 -> (0) lane=8 -> (0) lane=16 -> (0) - warp=1 -> (0) warp=2 -> (0) - block is a size 1 dimension where out dims are: [dim0 (size 16)] LinearLayout( { {S("register"), {{1}, {8}}}, {S("lane"), {{2}, {4}, {0}, {0}, {0}}}, {S("warp"), {{0}, {0}}}, {S("block"), {}}, }, {S("dim0")}) Which is: - register=1 -> (1) register=2 -> (8) - lane=1 -> (2) lane=2 -> (4) lane=4 -> (0) lane=8 -> (0) lane=16 -> (0) - warp=1 -> (0) warp=2 -> (0) - block is a size 1 dimension where out dims are: [dim0 (size 16)]

@lezcano Is this expected? Should I update the unit tests?

Ah it seems there is some hack in the SliceEncodingAttr conversion to LinearLayout about these registers...

I spent some time digging into this and it does not appear to be trivial to untangle. This requires removing the hack that was adding for SliceEncodingAttr::toLinearLayout, since there is effectively a circular dependency between these functions, but removing that hack is definitely a rabbithole. I will try to stick with this independent logic for now, but given how strange this hack is, it's probably something we should spend time looking at in the near future.

This reverts commit b3bf2b2.

Mogball added 3 commits January 27, 2025 15:04

Reapply "[Layouts] Propagate layouts into conditionals (#5610)"

6053a37

This reverts commit 98b40d5.

implement getElemsPerThread for dot encoding of blocked

26da6a7

fix propagation in and out of loops

60f8c5f

Mogball requested a review from ptillet as a code owner January 28, 2025 06:35

Mogball requested review from ThomasRaoux and Jokeren January 28, 2025 06:35

Mogball commented Jan 28, 2025

View reviewed changes

Mogball requested a review from lezcano January 28, 2025 06:39

Mogball added 2 commits January 28, 2025 11:05

just use linear layouts ^TM

b3bf2b2

Revert "just use linear layouts ^TM"

453f578

This reverts commit b3bf2b2.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reapply "[Layouts] Propagate layouts into conditionals (#5610)" #5725

Reapply "[Layouts] Propagate layouts into conditionals (#5610)" #5725

Mogball commented Jan 28, 2025 •

edited

Loading

Mogball Jan 28, 2025

lezcano Jan 28, 2025 •

edited

Loading

Mogball Jan 28, 2025

Jokeren Jan 28, 2025

Mogball Jan 28, 2025

Jokeren Jan 28, 2025

Mogball Jan 28, 2025 •

edited

Loading

Mogball Jan 28, 2025

Mogball Jan 30, 2025

Reapply "[Layouts] Propagate layouts into conditionals (#5610)" #5725

Are you sure you want to change the base?

Reapply "[Layouts] Propagate layouts into conditionals (#5610)" #5725

Conversation

Mogball commented Jan 28, 2025 • edited Loading

Mogball Jan 28, 2025

Choose a reason for hiding this comment

lezcano Jan 28, 2025 • edited Loading

Choose a reason for hiding this comment

Mogball Jan 28, 2025

Choose a reason for hiding this comment

Jokeren Jan 28, 2025

Choose a reason for hiding this comment

Mogball Jan 28, 2025

Choose a reason for hiding this comment

Jokeren Jan 28, 2025

Choose a reason for hiding this comment

Mogball Jan 28, 2025 • edited Loading

Choose a reason for hiding this comment

Mogball Jan 28, 2025

Choose a reason for hiding this comment

Mogball Jan 30, 2025

Choose a reason for hiding this comment

Mogball commented Jan 28, 2025 •

edited

Loading

lezcano Jan 28, 2025 •

edited

Loading

Mogball Jan 28, 2025 •

edited

Loading